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(57) Abstract: The present invention provides bacterial immunogenic agents for administration to humans and non-human animals 
to stimulate an immune response. Also provided are methods for vaccination of mammalian species, especially human patients, with 
variants of the E coli FimH protein, said variants being derived from different strains of E. coli, and to production of antibodies that 
protect the vaccine recipient against infection by pathogenic bacterial species. In another aspect the invention provides antibodies 
against such proteins and protein complexes that may be used as diagnostics and/or as prolec live/treatment agents for pathogenic 
bacterial species. A plasmid-based method of producing polypeptides, especially fused polypeptides, such as the complex of a 
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FIMH ADHESIN PROTEINS AND METHODS OF USE 

5 



This application claims the benefit of U.S. provisional application 
60/216,750, filed 7 July 2000, the disclosure of which is hereby incorporated by 
1 0 reference in its entirety. 



FIELD OF THE INVENTION 

15 

This invention relates to bacterial adhesin proteins and active fragments 
thereof for use in vaccine compositions for the prevention, diagnosis and 
treatment of bacterial induced diseases such as those of the urinary tract, 
especially to the use of such adhesins as immunogenic agents in humans and 
20 animals to stimulate an immune response. 

BACKGROUND OF THE INVENTION 

25 Urinary tract infections (herein, "UTI") present a disease process that is 

mediated by the attachment of bacteria to cells. Escherichia coli is the most 
common pathogen of the urinary tract, accounting for more than 85% of cases 
of asymptomatic bacteriuria, acute cystitis and acute pyelonephritis, as well as 
greater than 60% of recurrent cystitis, and at least 35% of recurrent 

30 pyelonephritis infections. Furthermore, approximately 25%-30% of women 
experience a recurrent E. coli urinary tract infection within the first 12 months 
following an initial infection but after a second or third infection the rate of 
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recurrence increases to 60%-75%. Given the high incidence, continued 
persistence, and significant expense associated with E coli urinary tract 
infections, there is a need for a prophylactic vaccine to reduce susceptibility to 
this disease. 

5 

While many factors contribute to the acquisition and progression of E coli 
urinary tract infections, it is generally accepted that colonization of the urinary 
epithelium is a required step in the infection process. In a typical course of E 
coli urinary tract infection, bacteria originate from the bowel, ascend into the 

10 bladder, and adhere to the bladder mucosa where they multiply and establish an 
infection (cystitis) before ascending into the ureters and kidneys. Disruption or 
prevention of pilus-mediated attachment of E. coli to urinary epithelia may 
prevent or retard the development of urinary tract infections. In this regard, a 
number of studies have pointed to a role for pili in mediating attachment to host 

15 uroepithelial cells. 

To initiate infection bacterial pathogens must first be able to colonize an 
appropriate target tissue of the host. For many pathogens this tissue is located 
at a mucosal surface. Colonization begins with the attachment of the bacterium 
20 to receptors expressed by cells forming the lining of the mucosa. Attachment is 
mediated via proteins on the bacterium that bind specifically to cellular 
receptors. These proteins, or adhesins, are expressed either directly on the 
surface of the bacterium, or more typically, as components of elongated rod-like 
protein structures called pili, fimbriae or fibrillae. 

25 

Type 1 pili are thought to be important in initiating colonization of the 
bladder and inducing cystitis, whereas P pili are thought to play a role in 
ascending infections and the ensuing pyelonephritis. 

30 Such pili are heteropolymeric structures that are composed of several 

different structural proteins required for pilus assembly. Two types of pili are of 
particular interest: P pili and type 1 pili. P pili-carrying bacteria recognize and 
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bind to the gal-(a1-4)gal moiety present in the globoseries of glycolipids on 
kidney cells in mammals. Type 1 pill-carrying bacteria recognize and bind to D- 
mannose in glycolipids and glycoproteins of bladder epithelial cells. 

5 RmH is the D-mannose-binding adhesin that promotes attachment of 

type 1 piliated bacteria to host cells via mannose-containing glycoproteins on 
eukaryotic cell surfaces. FimC is its periplasmic chaperone protein. 

In this specification, the terms "pili", "fimbriae," and "fibrillae" are used to 
10 refer to heteropolymeric protein structures located on the extracellular surface of 
bacteria, most commonly gram-negative bacteria. Typically these structures are 
anchored in the outer membrane. Throughout this specification the terms pilus, 
pili, fimbriae, and fibrilla will be used interchangeably. 

15 As used herein, the term "periplasmic chaperone" is defined as a protein 

localized in the periplasm of bacteria that is capable of forming complexes with a 
variety of chaperone-binding proteins via recognition of a common binding 
epitope (or epitopes). Chaperones serve as templates upon which proteins 
exported from the bacterial cell into the periplasm fold into their native 

20 conformations. Association of the chaperone-binding protein with the chaperone 
also serves to protect the binding proteins from degradation by proteases 
localized within the periplasm, increases their solubility in aqueous solution, and 
leads to their sequentially correct incorporation into an assembling pilus. 

25 Chaperone proteins are a class of proteins in gram-negative bacteria that 

are involved in the assembly of pili by mediating such assembly, but are not 
incorporated into the structure. FimC is the periplasmic chaperone protein that 
mediates assembly of type 1 pili in bacteria. 

30 It has recently been reported that such chaperones can direct formation 

of the appropriate native structure of the corresponding adhesin or pilin by 
inserting a specific fold of the chaperone protein in place of a missing domain or 
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helical strand of the chaperone or pilin. Thus, RmH proteins tend to have their 
native structure in the presence of such a chaperone but not in its absence, 
[see: Choudhury et al, X-ray Structure of the FimOFimH Chaperone-Adhesin 
Complex from Uropathogenic E. coli, Science 285, 1061 (1999); Sauer et al, 
5 Structural Basis of Chaperone Function and Pilus Biogenesis, Science 285, 
1058 (1999)] In addition, recent publications have indicated that the required 
chaperone strand can be inserted into the adhesin or pilin protein, such as 
FimH, to provide the missing structure and produce the correct native structure; 
Bamhart, M.M. et al., PapD-like Chaperones Provide the Missing Information for 
10 Folding of Pilin Proteins, Proc. Natl. Acad. Sci. (USA). 10.1073/pnas. 130 183897 
(published online June 20, 2000). 

Antibodies directed against purified whole type 1 or P pili protect against 
cystitis and pyelonephritis, respectively, in both murine and primate models for 
15 these diseases. See: Abraham et al., Infect Immun. 48:625 (1985), Roberts et 
al., Proc. A/aft Acad. Sci. (USA) 91:11889 (1994), O'Hantey et al., J. Clin. Invest 
75: 347 (1985). However, such protection is limited to either homologous E. coli 
strains from which the pili used as immunogens were derived, or to a small 
subset of serologically cross-reactive heterologous strains. Therefore, vaccines 
composed predominantly of the major structural proteins of pili (i.e., PapA or 
FimA) appear to be of limited value because antibodies developed against these 
highly variable proteins are specific for the strains used for immunization. 

Furthermore, antibodies to FimH have been found to be protective. 
Bamhart, M.M. et al., PapD-like Chaperones Provide the Missing Information for 
Folding of Pilin Proteins, Proc. Natl. Acad. Sci. (USA), 10.1073/pnas. 130 183897 
(published online June 20, 2000). 

Recently, Sokurenko et al [see: J. Bacterid. 177, 3680-86 (1995)] have 
found that quantitative variations in mannose-sensitive adhesion of E. coli are 
due primarily to structural differences in the FimH adhesin. Further research has 
shown that the ability of the FimH lectins to interact with monomannosyl 
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residues strongly correlates with their ability to mediate E. coli adhesion to 
uroepithelial cells so that certain phenotypic variants of type 1 fimbriae may 
contribute more than others to the virulence of E. coli in the urinary tract. 
[Sokurenko et al, J. BioL Chem. 272, 17880-17886 (1997)]. Heretofore, random 
5 point mutations in FimH genes that increase binding of the adhesin to mono- 
mannose residues (structures abundant in the oligosaccharide moieties of 
urothelial glycoproteins) have been found to confer increased virulence in the 
mouse urinary tract. [See: Sokurenko et al, Proc. Natl. Acad. ScL USA 95, 8922- 
8926 (1998)1 

10 

BRIEF SUMMARY OF THE INVENTION 

In one aspect, the present invention relates to immunogenic 
15 polypeptides derived from different strains of the bacterium Escherichia coli (E 
coli) which differ from each other in one or more amino acid residues and that 
induce immunological responses that lead to protection of an animal, especially 
a human patient, following vaccination with said polypeptides. 

20 It is an object of the present invention to provide immunogenic FimH 

polypeptides useful as components of vaccines for the prevention and treatment 
of infections caused by E co//, in particular, urinary tract infections. In specific 
embodiments, the polypeptides of the present invention have the amino acid 
sequences shown in Figure 2. Such vaccine compositions comprising the novel 

25 polypeptides disclosed herein are useful in vaccination and treatment of urinary 
tract infections, especially those caused by E. coli. In one embodiment, vaccine 
compositions according to the invention comprise the polypeptides of Figure 2 
(or truncated segments of the sequences of Figure 2 which truncates have 
mannose-binding ability and can serve as vaccines), as stabilized structures in 

30 the form of complexes with a chaperone, such as FimC, or with a portion of the 
sequence of FimC which portion serves to stabilize the structure of the FimH 



WO 02/04496 



PCTAJS01/21525 



adhesin. A particular embodiment employs the consensus sequence of Figure 2 
(SEQ ID NO: 55). 

It is another object of the present invention to provide antibodies, 
5 including both polyclonal and monoclonal antibodies, with specificity for the 
novel polypeptides disclosed herein and whose amino acid sequences are 
shown in Figure 2. 

It is a still further object of the present invention to provide methods of 
10 prophylaxis for the prevention of urinary tract infections using the vaccine 
compositions disclosed herein. 

It is a yet still further object of the present invention to provide methods of 
treatment of diseases of the urinary tract comprising the use of vaccine 
15 compositions as disclosed herein and the use of antibodies generated against 
such vaccine compositions, and the polypeptides contained therein. 

It is another object of the present invention to provide a novel method of 
preparing polypeptides from recombinant cells using a vector comprising the 

20 plasmid of Figures 3 through 6. In specific embodiments, the polypeptides 
comprise the amino acid sequences of Figure 2. In one embodiment, this 
process and plasmid can be used to prepare polypeptides that comprise a 
bacterial chaperone, such as FimC, fused to a bacterial adhesin, such as FimH 
or any of the polypeptides presented in Figure 2, including any consensus 

25 sequence derived therefrom. 

BRIEF DESCRIPTION OF THE DRAWINGS 

30 Figure 1 shows the nucleotide sequence encoding a number of FimH 

polypeptides disclosed according to the present invention and derived from E. 
coli Figures 1(a) through 1Q) follow in -sequence. 
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Figure 2 shows the amino acid sequences, and sequence variations, for 
a number of FimH proteins of different strains of £ coli with the particular strain 
listed at the left. Figures 1(a) through 1(c) follow in sequence. A consensus 
sequence is presented at the bottom. The sequence for FimH of J96 is known in 
5 the art. 

Figure 3 is a diagram showing the construction of the vector designated 
as pCGA101-8. 

10 Figure 4 is a diagram showing the construction of the vector designated 

as pCGA1 22-30. 

Figure 5 is a diagram showing the steps in selection of the final clone 
pCGA139-1-1. 

15 

Figure 6 is a diagram showing the arrangement of genes in the vector 
designated as pCGA1 39-1-1. 

20 

DETAILED DESCRIPTION OF THE INVENTION 

It is an object of the present invention to utilize an immunogenic 
composition for a vaccine (or to produce antibodies for use as a diagnostic or as 
25 a passive vaccine) comprising novel polypeptides, in particular, FimH 
polypeptides or variants thereof, either atone or in complexed form with a 
chaperone, or stabilization-inducing portion of a chaperone, or as a series of 
isolated non-contiguous domains, especially mannose-binding domains, linked 
together to form a polypeptide, or polypeptide-like, structure. 

30 

In one embodiment of the present invention, FimH proteins (naturally or 
recombinantly produced, as well as functional analogs) from bacteria that 
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produce type 1 pili are contemplated. Even more particularly, £ coli FimH 
proteins are contemplated. 

The present invention provides isolated polypeptides comprising a 
5 sequence selected from the group consisting of at least residues 26 to 186 of 
SEQ ID NO: 23 through 45 and 55. 

The present invention further relates to isolated polypeptides further 
comprising about the N-terminal two thirds (which includes about residues 1 
10 through 186) of the sequences selected from SEQ ID NO: 23 through 45 and 
55. 

The present invention still further relates to isolated polypeptides 
comprising an amino acid sequence selected from the group consisting of 
15 SEQ ID NO: 23 through 45 and 55. 

The polypeptides of the present invention may conveniently be present 
in the form of a composition comprising one or more of the polypeptides in 
any desired combination or relative concentrations. Further, the present 
20 invention includes active truncates, portions, fragments or segments of the 
novel polypeptides disclosed herein. 

The present invention also relates to an isolated polynucleotide 
encoding one or more of the polypeptides of the present invention, or active 
25 truncates, fragments, portions or segments thereof. In one embodiment, the 
isolated polynucleotides are shown in Figure 1(a) - 10). 

A vaccine composition according to the present invention is one 
comprising an immunogenically effective amount of a polypeptide of the 
30 invention, including immunogenically active truncates, portions, fragments and 
segments thereof, and in any and all active combinations thereof, wherein 
said polypeptide, or active fragment, or fragments, is/are suspended in a 
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pharmacologically acceptable carrier, which includes all suitable diluents or 
excipients. 

The present invention also relates to antibodies, either polyclonal or 
5 monoclonal, either recombinant or synthetic, specific for any of the 
polypeptides, as disclosed herein (SEQ ID NO: 23 through 45 and 55) 
including immunogenically active truncates, portions, fragments and 
segments of such polypeptides, which antibodies may be used alone or in any 
and all active combinations thereof. 



Such polypeptides, where the entire sequence is used to form a 
vaccine, are commonly structurally stabilized by complexing with a 
chaperone, especially FimC (so as to form a FimCH complex - see: 
Choudhury et al (1999) and Sauer et al (1999), the disclosures of which 

15 papers are hereby incorporated by reference in their entirety) or where such 
structural stabilization is produced by complementing the structure of the 
adhesin with a portion of a chaperone, such as FimC, that induces the 
stabilization of the adhesin for use as a vaccine [see Barnhart et al. supra, the 
disclosure of which is hereby incorporated by reference in its entirety), or 

20 where the mannose-binding portions of the sequences of the adhesins of 
Figure 2 are isolated from the native molecules, or synthesized, and then 
joined together to form a structure comprising 1, 2, 3 or more such domains 
as part of a single polypeptide (other than the native protein) or where such 
domains are held together by some other chemical linkages. 



Because of the immunogenicity and protective properties of the 
polypeptides and fragments disclosed herein, the present invention further 
relates to methods for preventing and/or treating disease in an animal, 
especially a human patient afflicted therewith, comprising administering to 
30 said animal an amount of a vaccine composition of the present invention that 
is effective for such prevention and/or treatment. In specific embodiments of 
the present invention, said disease is a urinary tract infection or a bladder 



10 



25 



9 



WO 02/04496 



PCT/US01/21525 



infection. In preferred embodiments, said disease is caused by a bacterium of 
the family Enterobacteriaceae, especially £ coli, that is effective for such 
preventing and/or treating. 

5 The present invention further relates to methods of treating disease in 

an animal, especially a human patient afflicted therewith, comprising 
administering to said animal a therapeutically effective amount of an antibody 
of the invention as disclosed hereinabove. In specific embodiments of the 
present invention, said disease is a urinary tract infection or a bladder 

10 infection. In preferred embodiments, said disease is caused by a bacterium of 
the family Enterobacteriaceae, especially E. coli 

The present invention also relates to a recombinant cell expressing a 
polypeptide, and fragments thereof, selected from the group consisting of 
15 SEQ ID NO: 23 through 45, and 55. Here, the term "expressing" means 
synthesizing a polypeptide from the disclosed polynucleotides of the 
invention. 

The present invention also relates to a vector comprising a 
20 polynucleotide selected from the polynucleotides encoding the polypeptides, 
and active fragments thereof, of the invention. 

In accordance with the disclosure herein, the present invention also 
includes an immunogenic complex comprising a periplasmic chaperone and a 
25 polypeptide, or active fragment thereof, selected from the group consisting of 
the polypeptides of SEQ ID NO: 23 through 45, and 55. 

A preferred embodiment of such an immunogenic composition is a 
composition wherein the active component of said immunogenic composition is 
30 a member selected from mannose-binding fragments of FimH adhesin protein 
variants, or where said proteins are donor complemented single polypeptide 
structures or present in the form of an immunogenic complex comprising non- 
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contiguous mannose-binding domains derived from said variants (see 
Langermann, U.S. Patent Application Serial No. 60/144,016, filed July 15, 1999, 
the disclosure of which is hereby incorporated by reference in its entirety). 

5 In another aspect of the invention, immunogenic compositions of the 

invention may be utilized to produce antibodies to diagnose or treat diseases, 
especially urinary tract infections, or to produce vaccines for prophylaxis and/or 
treatment of such infections as well as booster vaccines to maintain a high titer 
of antibodies against the immunogen(s) of the immunogenic composition. 

10 

Applicant has found that FimH polypeptides are highly conserved 
between various strains of E.coli Moreover, the amino acid changes that occur 
between strains generally occur at similar amino acid positions. 

15 As a result of the high conservation of FimH between E.coli strains, FimH 

polypeptides from one strain are capable of inducing antibody responses that 
inhibit or prevent other Ecoli strains from binding to cells by a FimH lectin and/or 
provide protection and/or treatment against infection caused by other E.coli 
strains. The SEQ ID NOs. for the FimH proteins (including a consensus 

20 sequence) and polynucleotides of the present invention are provided in Table 1 
and correlate with the strain designations provided in Figures 1 and 2, and the 
data of Table 2. 

There are various ways in which the polypeptides disclosed according to 
25 the present invention can be utilized to form therapeutically effective vaccine 
compositions for use in the prevention and/or treatment of urinary tract and 
bladder infections, especially those caused by E. coli. 

In accordance with the teachings herein, the present invention is directed 
30 to an immunogenic composition comprising a purified complex of a periplasmic 
chaperone protein, especially FimC, and a FimH variant according to the 
invention. The FimC may be obtained from the same E.coli strain as the FimH 
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or from a different strain. The FimH polypeptide, or portion thereof, is 
maintained in the complex in an immunogenic form capable of inducing an 
immune response when appropriately introduced into a human patient in need 
thereof. As used herein, the term "patient in need thereof refers to a human that 
5 is infected with, or at risk of being infected with, pathogenic bacteria that 
produce pili, especially E. coff and related bacteria. For research purposes, a 
mouse model can be utilized to simulate such a patient in some circumstances. 
See Choudhury, et al., supra, and Sauer, et al., supra. 

10 The polypeptides according to the present invention can also be utilized 

as components of therapeutically effective vaccine compositions by forming the 
native structures of said FimH variants without the need for a complex with 
FimC itself. Thus, the FimH variants as disclosed herein are readily produced by 
recombinant methods in such a way as to incorporate therein the sequence of 

15 FimC required for stabilization (see Bamhart et al (2000)). 

An additional means of utilizing the novel polypeptides, or FimH variants, 
of the present invention is to form synthetic structures comprising non- 
contiguous domains of said variants. It is known that the cell binding portions of 

20 FimH are generally composed of the mannose-binding segments, formed of 
about the N-temninal two thirds of the molecule. The remaining pilin-binding 
portion is the segment that interacts with FimC to form a complex in the fibrillum 
of the bacterial cell. Thus, the FimH variants of the present invention are readily 
engineered to produce only the specific, and relatively short, mannose-binding 

25 domains of the N-tenminal two thirds of the sequences depicted in Figure 2. All 
or a portion of these domains may be strung together using convenient linker 
sequences, or other linking structures, to provide polypeptides composed of 
non-contiguous mannose binding domains, the overall structure of which 
provides a highly immunogenic structure for use in the vaccine compositions 

30 disclosed herein (see Langermann supra). 
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Of course, these are only three means of utilizing the FimH variants 
disclosed according to the present invention and other useful embodiments will 
suggest themselves to those skilled in the relevant arts from the teachings 
herein. 

5 

The proteins, and immunologically active fragments thereof, of the 
present invention are useful immunogens for preparing vaccine compositions 
that stimulate the production of antibodies that confer immunity to pathogenic 
species of bacteria. Further, preparation of vaccines containing purified proteins 
10 as antigenic ingredients is well within the level of skill in the art. 

The pharmaceutical compositions useful herein also contain a 
pharmaceutical^ acceptable carrier, including any suitable diluent or 
excipient, which includes any pharmaceutical agent that does not Itself induce 

15 the production of antibodies harmful to the individual receiving the 
composition, and which may be administered without undue toxicity. 
Pharmaceutically acceptable carriers include, but are not limited to, liquids 
such as water, saline, glycerol and ethanol, and the like, including carriers 
useful in forming sprays for nasal and other respiratory tract delivery or for 

20 delivery to the ophthalmic system. 

Vaccine compositions may, and usually do, incorporate additional 
substances to stabilize pH, or to function as adjuvants, wetting agents, or 
emulsifying agents, which can serve to improve the effectiveness of the vaccine. 

25 

Vaccines are generally formulated for parenteral administration and are 
injected either subcutaneously or intramuscularly. Such vaccines can also be 
formulated as suppositories or for oral administration, using methods known in 
the art. 

30 

The amount of vaccine sufficient to confer immunity to pathogenic 
bacteria is determined by methods well known to those skilled in the art. This 
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quantity will be determined based upon the characteristics of the vaccine 
recipient, including considerations of age, sex, and general physical condition, 
and the level of immunity required. Where vaccines are administered by 
subcutaneous or intramuscular injection, a range of 1 to 500 ^g purified protein 
5 may be given. 

In addition to use as vaccines, the FimH variants disclosed herein are 
available for use as immunogens to stimulate the production of antibodies for 
use in passive immunotherapy, for use as diagnostic reagents, and for use as 
1 0 reagents in other processes such as affinity chromatography. 

In one aspect of the invention complexes comprising the £ coli 
chaperone FimC and a FimH variant of the invention may be formed by co- 
expressing the appropriate FimH variant polypeptide (of a sequence according 
to Figure 2) along with FimC, whose amino acid and nucleotide sequences are 
1 5 known in the art, from a recombinant cell. 

In addition, the FimC-FimH complexes useful in vaccines can be 
recovered from the periplasmic spaces of cells of the indicated strains disclosed 
herein. These complexes are found in relatively large amounts in recombinant 

20 £ coli strains which express the FimC protein at levels in excess of those 
produced in wild type strains. A suitable recombinant strain is C600/pHJ9205, a 
strain in which expression of FimC has been put under control of the arabinose 
promoter. Those skilled in the art will recognize that other promoter sequences 
that can be regulated easily may also be used. Of course, such cells are readily 

25 engineered to express one or more of the FimH variant polypeptides of the 
invention. An extract of periplasm is obtained by exposing the bacteria to 
lysozyme in the presence of a hypertonic sucrose solution. FimOH complexes 
can also be purified using conventional protein purification methods well known 
in the art. 

30 

In a similar manner, FimH fragments that are recombinantly produced 
either by having £ coli produce the full-length FimH variant polypeptides of the 

14 
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present invention and then fragmenting the protein, or the fragment itself, are 
produced recombinant^ -may be isolated by mannose-binding affinity 
purification. Thus, only fragments of the FimH protein that retain mannose 
binding are isolated. Preferably, such mannose-binding fragments have a label 
5 such as a his-tag included and may be purified by methods such as Nickel 
chromatography. 

As used herein, the terms "portion," "segment," and "fragment," when 
used in relation to polypeptides, refer to a continuous sequence of residues, 

10 such as amino acid residues, which sequence forms a subset of a larger 
sequence. For example, if a polypeptide were subjected to treatment with any of 
the common endopeptidases, such as trypsin or chymotrypsin, the oligopeptides 
resulting from such treatment would represent portions, segments or fragments 
of the starting polypeptide. When used in relation to a polynucleotides, such 

15 terms refer to the products produced by treatment of said polynucleotides with 
any of the common endonucleases. 

The polynucleotides encoding the polypeptides of the invention, for 
20 example, those of Figure 1, may have the coding sequence fused in frame to a 
marker sequence which allows for purification of the polypeptides of the present 
invention. The marker sequence may be, for example, a hexa-histidine tag 
supplied by a pQE-9 vector to provide for purification of the mature polypeptides 
fused to the marker in the case of a bacterial host, or, for example, the marker 
25 sequence may be a hemagglutinin (HA) tag when a mammalian host, e.g. COS- 
7 cells, is used. The HA tag corresponds to an epitope derived from the 
influenza hemagglutinin protein (Wilson, I., et al., Cell, 37:767 (1984)). 

The present invention also relates to vectors which include 
30 polynucleotides encoding one or more of the adhesin proteins of the present 
invention, host cells which are genetically engineered with vectors of the 
invention and the production of such adhesin proteins and/or chaperone 
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proteins by recombinant techniques in an isolate and substantially 
immunogenically pure form. 

Host cells are genetically engineered (transduced or transformed or 
5 transfected) with the vectors comprising a polynucleotide encoding a chaperone, 
adhesin protein, mannose binding fragment of an adhesin protein, or the like of 
this invention which may be, for example, a cloning vector or an expression 
vector. The vector may be, for example, in the form of a plasmid, a viral particle, 
a phage, etc. The engineered host cells can be cultured in conventional nutrient 
10 media modified as appropriate for activating promoters, selecting transformants 
or amplifying the polynucleotides which encode such polypeptides. The culture 
conditions, such as temperature, pH and the like, are those previously used with 
the host cell selected for expression, and will be apparent to the ordinarily skilled 
artisan. 

15 

Vectors include chromosomal, nonchromosomal and synthetic DNA 
sequences, e.g., derivatives of SV40; bacterial plasmids; phage DNA; 
baculovirus; yeast plasmids; vectors derived from combinations of plasmids and 
phage DNA, viral DNA such as vaccinia, adenovirus, fowl pox virus, and 
20 pseudorabies. However, any other vector may be used as long as it is 
replicable and viable in the host Such procedures and others are deemed to 
be within the scope of those skilled in the art. 

The DNA sequence in the expression vector is operatively linked to an 
25 appropriate expression control sequence(s) (promoter) to direct mRNA 
synthesis. As representative examples of such promoters, there may be 
mentioned: LTR or SV40 promoter, the £. colL lac or frp, the phage lambda P L 
promoter and other promoters known to control expression of genes in 
prokaryotic or eukaryotic cells or their viruses. The expression vector also 
30 contains a ribosome binding site for translation initiation and a transcription 
terminator. The vector may also include appropriate sequences for amplifying 
expression. 
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In addition, the expression vectors preferably contain one or more 
selectable marker genes to provide a phenotypic trait for selection of 
transformed host cells such as dihydrofolate reductase or neomycin resistance 
5 for eukaryotic cell culture, or such as tetracycline or ampicillin or kanamycin 
resistance in E. coli . 

For example, optimal expression of a FimH-C complex has been 
achieved using a newly constructed single vector containing the FimH and FimC 

10 genes but having the advantage that each gene is under its own separate lac 
promoter. Thus, one lac promoter is 5* with respect to FimC while the second lac 
promoter is 5* to the FimH gene. This plasmid was successfully constructed 
using the common plasmid pUC19 as a background vector [Yannish-Perron, C, 
Vierira, J. and Messing, J., Gene, 33:103-119 (1985)]. This new plasmid, when 

15 used to transform the host E. coli strain BL21 [as described in Phillips, T.A., Van 
Bogelen, R. A., and Neidhart, F.C., J. Bacteriol. 159:283-287 (1984)] and then 
induced using IPTG at the mid-logarithmic stage of growth, gives maximal 
expression of the FimCH complex in the bacterial periplasmic space. This 
material is then extracted and purified by methods well known in the art, 

20 including those described herein. 

The vector containing the appropriate DNA sequence as hereinabove 
described, as well as an appropriate promoter or control sequence, may be 
employed to transform an appropriate host to permit the host to express the 
25 proteins. 

As representative examples of appropriate hosts, there may be 
mentioned: bacterial cells, such as £ coli, Streptomyces, Salmonella 
typhimurium; fungal cells, such as yeast; insect cells such as Drosophila S2 and 
30 Spodoptera Sf9; animal cells such as CHO, COS or Bowes melanoma; 
adenoviruses; plant cells, etc. The selection of an appropriate host is deemed to 
be within the scope of those skilled in the art from the teachings herein. 
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More particularly, the present invention also includes recombinant 
constructs comprising one or more of the sequences as broadly described 
above. The constructs comprise a vector, such as a plasmid or viral vector, into 
5 which a sequence of the invention has been inserted, in a forward or reverse 
orientation. In a preferred aspect of this embodiment, the construct further 
comprises regulatory sequences, including, for example, a promoter, operably 
linked to the sequence. Large numbers of suitable vectors and promoters are 
known to those of skill in the art, and are commercially available. The following 

10 vectors are provided by way of example. Bacterial: pQE70, pQE60, pQE-9 
(Qiagen, Inc.), pbs, pD10, phagescript, psiX174, pbluescript SK, pbsks, pNH8A, 
pNH16a, pNH18A, pNH46A (Stratagene); ptrc99a, pKK223-3, pKK233-3, 
pDR540, pRIT5 (Pharmacia). Eukaryotic: pWLNEO, pSV2CAT, pOG44, pXT1, 
pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia). However, any 

15 other plasmid or vector may be used as long as they are replicable and viable in 
the host. 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. 
20 Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial 
promoters include lad, lacZ, T3, T7, gpt, lambda P R , P L and TRP. Eukaryotic 
promoters include CMV immediate early, HSV thymidine kinase, early and late 
SV40, LTRs from retrovirus, and mouse metallothionein-l. Selection of the 
appropriate vector and promoter is well within the level of ordinary skill in the art. 

25 

In a further embodiment, the present invention relates to host cells 
containing the above-described constructs. The host cell can be a higher 
eukaryotic cell, such as a mammalian cell, or a lower eukaryotic cell, such as a 
yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. 
30 Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-Dextran mediated transfection, or electroporation 
(Davis, L, Dibner, M M Battey, I., Basic Methods in Molecular Biology, (1986)). 
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The constructs in host cells can be used in a conventional manner to 
produce the gene product encoded by the recombinant sequence. Alternatively, 
the polypeptides of the invention can be synthetically produced by conventional 
5 peptide synthesizers. 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, or 
other cells under the control of appropriate promoters. Cell-free translation 
systems can also be employed to produce such proteins using RNAs derived 

10 from the DMA constructs of the present invention. Appropriate cloning and 
expression vectors for use with prokaryotic and eukaryotic hosts, as well as 
other methods in molecular biology, are described in Sambrook, et al. t 
Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, 
N.Y., (1989), Wu et al, Methods in Gene Biotechnology (CRC Press, New 

15 York, NY, 1997), and Recombinant Gene Expression Protocols, in Methods in 
Molecular Biology, Vol. 62, (Tuan, ed., Humana Press, Totowa, NJ, 1997), the 
disclosures of which are hereby incorporated by reference. 

Transcription of the DIMA encoding the polypeptides of the present 
20 invention by higher eukaryotes is increased by inserting an enhancer sequence 
into the vector. Enhancers are cis-acting elements of DNA, usually about from 
10 to 300 bp that act on a promoter to increase its transcription. Examples 
including the SV40 enhancer on the late side of the replication origin bp 100 to 
270, a cytomegalovirus early promoter enhancer, the polyoma enhancer on the 
25 late side of the replication origin, and adenovirus enhancers. 

Generally, recombinant expression vectors will include origins of 
replication and selectable markers permitting transformation of the host cell, 
e.g., the ampicillin resistance gene of E. coli and S. cerevisiae TRP1 gene, and 
30 a promoter derived from a highly-expressed gene to direct transcription of a 
downstream structural sequence. Such promoters can be derived from operons 
encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), a- 
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factor, acid phosphatase, or heat shock proteins, among others. The 
heterologous structural sequence is assembled in appropriate phase with 
translation initiation and termination sequences. Optionally, the heterologous 
sequence can encode a fusion protein including an N-terminal identification 
5 peptide imparting desired characteristics, e.g., stabilization or simplified 
purification of expressed recombinant product. 

Useful expression vectors for bacterial use are constructed by inserting a 
structural DNA sequence encoding a desired protein together with suitable 

10 translation initiation and termination signals in operable reading phase with a 
functional promoter. The vector will comprise one or more phenotypic selectable 
markers and an origin of replication to ensure maintenance of the vector and to, 
if desirable, provide amplification within the host. Suitable prokaryotic hosts for 
transformation include E. coli, Bacillus subtilis, Salmonella typhimurium and 

15 various species within the genera Pseudomonas, Streptomyces, and 
Staphylococcus, although others may also be employed as a matter of choice. 

As a representative but non-limiting example, useful expression vectors 
for bacterial use can comprise a selectable marker and bacterial origin of 

20 replication derived from commercially available plasmids comprising genetic 
elements of the well known cloning vector pBR322 (ATCC 37017). Such 
commercial vectors include, for example, pKK223-3 (Pharmacia Fine 
Chemicals, Uppsala, Sweden) and GEM1 (Promega Biotec, Madison, Wl, USA). 
These pBR322 "backbone- sections are combined with an appropriate promoter 

25 and the structural sequence to be expressed. 

Following transformation of a suitable host strain and growth of the host 
strain to an appropriate cell density, the selected promoter is induced by 
appropriate means (e.g., temperature shift or chemical induction) and cells are 
30 cultured for an additional period. 
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Cells are typically harvested by centrifugation, disrupted by physical or 
chemical means, and the resulting crude extract retained for further purification. 

Microbial cells employed in expression of proteins can be disrupted by 
5 any convenient method, including freeze-thaw cycling, sonication, a french 
press, mechanical disruption, or use of cell lysing agents, such methods are well 
know to those skilled in the art 

Various mammalian cell culture systems can also be employed to 
10 express recombinant protein. Examples of mammalian expression systems 
include the COS-7 lines of monkey kidney fibroblasts, described by Gluzman, 
Cell, 23:175 (1981), and other cell lines capable of expressing a compatible 
vector, for example, the C127, 3T3, CHO, HeLa and BHK cell lines. Mammalian 
expression vectors will comprise an origin of replication, a suitable promoter and 
15 enhancer, and also any necessary ribosome binding sites, polyadenylation site, 
splice donor and acceptor sites, transcriptional termination sequences, and 5 r 
flanking nontranscribed sequences. DNA sequences derived from the SV40 
splice, and polyadenylation sites may be used to provide the required 
nontranscribed genetic elements. 

20 

The polypeptides can be recovered and/or purified from recombinant cell 
cultures by well-known protein recovery and purification methods. Such 
methodology may include ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, phosphoceliulose 

25 chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. 
Protein refolding steps can be used, as necessary, in completing configuration 
of the mature protein. In this respect, chaperones may be used in such a 
refolding procedure. Finally, high performance liquid chromatography (HPLC) 

30 can be employed for final purification steps. 
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The polypeptides that are useful as immunogens in the present invention 
may be a naturally purified product, or a product of chemical synthetic 
procedures, or produced by recombinant techniques from a prokaryotic or 
eukaryotic host (for example, by bacterial, yeast, higher plant, insect and 
5 mammalian cells in culture). Depending upon the host employed in a 
recombinant production procedure, the polypeptides of the present invention 
may be glycosylated or may be non-glycosylated. 

Procedures for the isolation of a periplasmic chaperone protein 
10 complexed with an adhesin protein are known in the art, as an example see 
Jones et al., Proc. Natl Acad, Sci. (USA) 90:8397-8401 (1993). Further, the 
individually expressed adhesin proteins may be isolated by recombinant 
expression/isolation methods that are well-known in the art. Typical examples 
for such isolation may utilize an antibody to the protein or to a His tag or 
1 5 cleavable leader or tail that is expressing as part of the protein structure. 

It is contemplated that the polypeptides of the present invention may be 
in isolated or purified form. 

20 "Isolated" in the context of the present invention with respect to 

polypeptides (or polynucleotides) means that the material is removed from its 
original environment (e.g., the cells used to recombinantly produce the 
polypeptides disclosed herein). Such peptides could be part of a composition, 
and still be isolated in that such vector or composition is not part of its natural 

25 environment The polypeptides and polynucleotides of the present invention are 
preferably provided in an isolated form, and preferably are purified to 
homogeneity. 

The recombinant and/or immunogenic polypeptides, disclosed in 
30 accordance with the present invention, may also be in "purified* form. The 
term "purified" does not require absolute purity; rather, it is intended as a 
relative definition, and can include preparations that are highly purified or 
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preparations that are only partially purified, as those terms are understood by 
those of skill in the relevant art. For example, polypeptides from individual 
clones isolated from a cDNA library have been conventionally purified to 
electrophoretic homogeneity. Purification of starting material or natural 
5 material to at least one order of magnitude, preferably two or three orders, 
and more preferably four or five orders of magnitude is expressly 
contemplated. Furthermore, claimed polypeptides having a purity of preferably 
0.001%, or at least 0.01% or 0.1%, and even desirably 1% by weight or 
greater is expressly contemplated. 

10 

For purposes of recombinantly producing the polypeptides of the 
invention, the term "expression producf means that polypeptide or protein 
that is the natural translation product of the gene and any nucleic acid 
sequence coding equivalents resulting from genetic code degeneracy and 
1 5 thus coding for the same amino acid(s). 

The FimCH polypeptides useful in forming the vaccine compositions of 
the present invention may conveniently be cloned using various cloning 
systems. An example of a useful cloning system for synthesizing FimCH 

20 molecules that may incorporate any of the FimH sequences disclosed herein 
(see Figure 2) is presented in Example 2 and utilizes a plasmid based cloning 
system. The FimCH complex described therein is composed of a 52 kDa 
complex composed of two proteins: FimC (22.8 kDa) and FimH (29.1 kDa) in 
a 1:1 equimolar ratio. The FimCH complex is expressed from a pUC-based 

25 vector (pGCA1 39-1-1) with two separate lac-inducible promoters driving 
expression of the FimC and FimH genes, respectively. The FimC and the 
FimH genes in the pGCA1 39-1-1 vector were derived from uropathogenic E. 
coli isolate J96. 

30 The FimCH complex is produced in the periplasm of E. coli strain BL21 

and is purified from periplasmic extracts by standard chromatographic 
methods. The FimCH protein has been formulated in a number of different 
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buffers compatible with its solubility profile including 20 mM HEPES (pH 7.0), 
PBS (pH 7.0) and sodium citrate (pH 6.0) + 0.2 M NaCI. This sodium 
citrate/sodium chloride formulation enhances the stability of the FimCH 
complex and is also compatible with commonly used diluents. 

5 

Plasmid pCGA139-1-1 was constructed as a means of producing 
relatively large amounts of E coli chaperone-adhesin complex, FimCH, for 
use in the vaccine compositions disclosed herein. Such vaccines act by 
blocking adherence of E coli to the mucosa of the urinary tract thus 
10 preventing colonization which often results in infection. 

The plasmid vector, pCGA1 39-1-1, contains the following genetic 
elements: (1) an E. coli FimC chaperone gene followed by (2) the fimH 
adhesin gene, both from E. coli strain J96 (a urinary tract infection (UTl) 

15 isolate) each preceeded by its respective native signal sequence (nss); (3) a 
kanamycin resistance (karf or If) marker; (4) lacP which codes for a repressor 
protein that binds the lac promoter unless it is induced; (5) an inactivated 
beta-lactamase (bla) gene; (6) pUC origin of replication (on); and (7) two lac 
promoters, one preceding the fimC signal and the other preceding that of 

20 fimH. 

The polypeptides, their fragments or other derivatives, or analogs thereof, 
or cells expressing them can be used as an immunogen to produce antibodies 
thereto. These antibodies can be, for example, polyclonal or monoclonal 
25 antibodies. The present invention also includes chimeric, single chain, and 
humanized antibodies, as well as Fab fragments, or the product of an Fab 
expression library. Various procedures known in the art may be used for the 
production of such antibodies and fragments. 

30 Antibodies generated against the variant FimH polypeptides 

corresponding to a sequence of the present invention can be obtained by direct 
injection of the polypeptides into an animal or by administering the polypeptides 
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to an animal, preferably a nonhuman. The antibody so obtained will then bind 
the polypeptides itself. In this manner, even a sequence encoding only a 
fragment of the polypeptides can be used to generate antibodies binding the 
whole native polypeptides. 

5 

For preparation of monoclonal antibodies, any technique which provides 
antibodies produced by continuous cell line cultures can be used. Examples 
include the hybridoma technique (Kohler and Milstein, Nature, 256:495-497 
(1975)), the trioma technique, the human B-cell hybridoma technique (Kozbor et 
10 al. v Immunology Today 4:72 (1983)), and the EBV-hybridoma technique to 
produce human monoclonal antibodies (Cole, et al. (1985) in Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

Techniques described for the production of single chain antibodies (U.S. 
15 Patent 4,946,778) can be adapted to produce single chain antibodies to 
immunogenic polypeptide products of this invention. Also, transgenic mice may 
be used to express humanized antibodies to immunogenic polypeptide products 
of this invention. 

20 In order to facilitate understanding of the above description and the 

examples which follow below certain frequently occurring methods and/or terms 
will be described. 

"Plasmids" are designated by a lower case p preceded and/or followed 
25 by capital letters and/or numbers. The starting plasmids herein are either 
commercially available, publicly available on an unrestricted basis, or can be 
constructed from available plasmids in accord with published procedures. In 
addition, equivalent plasmids to those described are known in the art and will be 
apparent to the ordinarily skilled artisan. 

30 

In carrying out the procedures of the present invention it is of course to 
be understood that reference to particular buffers, media, reagents, cells, 
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culture conditions and the like are not intended to be limiting, but are to be 
read so as to include all related materials that one of ordinary skill in the art 
would recognize as being of interest or value in the particular context in which 
that discussion is presented. For example, it is often possible to substitute one 
5 buffer system or culture medium for another and still achieve similar, if not 
identical, results. Those of skill in the art will have sufficient knowledge of 
such systems and methodologies so as to be able, without undue 
experimentation, to make such substitutions as will optimally serve their 
purposes in using the methods and procedures disclosed herein. 

10 

The present invention will now be further described by way of the 
following non-limiting examples. In applying the disclosure of these examples, 
it should be kept clearly in mind that other and different embodiments of the 
methods disclosed according to the present invention will no doubt suggest 
1 5 themselves to those of skill in the relevant art. 



EXAMPLE 1 

20 Passive immunization using the FimH variants of the present invention 

was demonstrated as follows. Anti-serum against FimC and FimCH complexes 
(using strain J96 (SEQ ID NO: 44) as the source of FimH and strain NU14 as 
source of FimC) was generated as two different pools and used for these 
experiments. Mice were passively immunized intraperitoneally with 100 |il each 

25 of either anti-C or anti-FimCH rabbit sera 24 hours and 4 hours prior to 
inoculation. Endpoint titers for the rabbit sera were determined to be at least 
1:500,000 by ELISA against the respective antigens. 

Bacteria of the respective strains were then collected, washed and re- 
30 suspended in phosphate buffered saline (PBS) and cell concentration adjusted 
to OD = 1.8 (at 600 nm). This suspension was then diluted 1:10 in PBS and 
tested for hemagglutination (HA) with guinea pig erythrocytes. This final 
suspension was used as inoculum and viability was determined on TSA plates. 
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Mice were anaesthetized and then inoculated intraurethrally with 50 \x\ of £ coli 
suspension containing about 3 X 10 7 cfu (colony forming units). Two days post- 
inoculation, the mice were sacrificed and bladders were removed and collected 
into 500 \il PBS supplemented with 1% mannose. The number of CFU's per 
5 bladder was determined by grinding the bladders with a tissue tearer and then 
diluting and plating the suspension on TSA plates. The mean number of colony 
forming units per bladder was determined and data transformed to tog 
CFU/bladder (as reported in Table 2). 



27 



WO 02/04496 



PCT/US01/21525 



Table 1. SEQ ID NOs. of Nucleotides and Amino acids 

Strain Nucleotide Polypeptide 

SEQ ID NO: SEQ ID NO: 

5 



J96 


21 


A A 

44 


Ml M A 

NU14 


22 


4o 


B210 


A 
1 


23 


B212 


2 


24 


B217 


3 


25 


B223 


4 


26 


B228 


5 


27 


B238 


6 


28 


B240 


7 


29 


B242 


8 


30 


DS17 


9 


31 


EC42 


10 


32 


EC45 


11 


33 


EC56 


12 


34 


EC58 


13 


35 


EC60 


14 


36 


EC61 


15 


37 


EC62 


16 


38 


EC80 


17 


39 


EC89 


18 


40 


EC95 


19 


41 


G162 


54 


42 


G189 


20 


43 


Consensus 




55 



30 
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Table 2. Passive Protection by FimH Variants 





Mean Log CFU per Bladder 


T-test 


Strain 


FimC 


RmCH 


Naive 


Cvs. CH 


CH vs. Naive 


B223 


7.79 


5.69 


7.58 


0.0034 


0.0107 


EC45 


6.43 


4.58 


ND 


0.0087 


ND 


Nu14 


4.54 


2.53 


5.22 


0.0014 


0.0000428 


B217 


4.47 


3.49 


5.17 


0.0142 


0.0007 


DS17 


4.64 


3.02 


4.45 


0.0163 


0.0355 


B218 


4.30 


2.99 


4.16 


0.0066 


0.0331 


B220 


4.18 


1.93 


3.55 


0.0000257 


0.0016 


EC56 


3.02 


2.60 


3.34 


0.5245 


0.2222 


EC42 


2.47 


1.13 


2.83 


0.0274 


0.0013 


J96 


2.09 


0.96 


2.29 


0.1005 


0.0328 


B212 


3.20 


2.05 


3.20 . 


0.0167 


0.443 



5 EXAMPLE 2 

Construction of Plasmid Vector for FimCH Production 

The plasmid vector, pCGA1 39-1-1, for production of FimCH was 
constructed in several steps. Although this example is directed to the use of 
10 FimH and FimC from J96, the example is also applicable to the FimH variants of 
the invention. 

Construction of pCGA101 -8 

Genomic DNA was prepared from Escherichia colt strain J96. The pellet 
15 from 1.0 ml of an overnight culture was washed with PBS, resuspended in 500 
Hi sterile sucrose Tris EDTA (0.3 molar sucrose: 100 millimolar Tris, pH8.0: 50 
millimolar EDTA) and 0.01 yg lysozyme was added thereto. The suspension was 
incubated at 37°C for 10 minutes and SDS was added to a final concentration of 
0.5%. The mixture was then treated with RNase for 10 minutes at 37°C after 
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which the DNA was extracted with water saturated phenol followed by a 
mixtgure of chloroform: isoamyl alcohol at a 24:1 ratio v/v and ethanol 
precipitated. The resulting pellet was washed with 70% ethyl alcohol, dried and 
resuspended in a solution containing 10.0 mM Tris and 0.1 mM EDTA. This 
5 DNA was used as template for PCR amplification of the fimC gene. 

PCR was performed with primers ga1F and ga2R (SEQ ID NO: 56 and 
57, respectively) containing Ncol and Bglll restriction sites, respectively. 
Conditions used were: 1 cycle at 95°C for 1.0 min; 25 cycles of amplification with 

10 strand separation at 95°C for 30 sec, annealing at 50°C for 30 sec and strand 
elongation at 72°C for 2.0 min. This was followed by one 10 min cycle at 72°C to 
ensure complete elongation of all ends. PCR products were purified via Qiagen 
columns and the gene was cloned into the vector pPW19R (construction 
described below), 3* to the Pel B leader sequence on the plasmid. The result 

15 was plasmid pCGA101-8 (shown in Figure 3). 

Construction of pCG A 122-30 

A kanamycin resistance gene was excised as a AlwNI/Styl fragment from 
pET26b(+) (Novagen) and cloned into the unique Drall site in pTTQ18 5* of the 
20 lacf* producing pTTQ18K. This plasmid contains the lacP and karf genes in 
tandem so they can be cloned as a single cassette (see Figure 4). 

In an effort to obtain optimal yields of FimH, its native signal was used to 
replace the Pel B leader sequence. A method utilizing overlapping PCR was 

25 used. Primary PCR segments were (1) fimH gene and its native signal from 
genomic J96 DNA with primers ga13F and ga6R (SEQ ID NO: 58 and 59, 
respectively) and (2) lac promoter/operator (lac p/o) from pPW19R with primers 
ga11F and ga9R (SEQ ID NO: 60 and 61, respectively). Overlapping PCR 
resulted in a single fragment primed with ga11F and ga6R containing Bglll and 

30 Sail sites, respectively. 
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Vector pCGA122-30 was made by ({gating (1) the Bglll/Sall PCR 
fragment consisting of lac p/o, fimH native signal sequence + fimH; (2) the 
Bglll/Scal fragment from pCGA101-8 containing the beta-lactamase gene, pUC 
on, lac p/o, pelB leader, and fimC; and (3) the cassette containing lacP and karf 
5 from pTTQ18K as a Sail/Seal fragment. 

Construction of pGA139-1 and selection of the final clone 

The PelB signal 5' to fimC was replaced with fimC native signal: primary 
fragments for fimC native signal replacement were derived from (1) genomic J96 

0 DNA with primers ga24F and ga23R, and (2) lac p/o from pPW19R with primers 
as mentioned previously. FimC and its native signal were the result of 
overlapping PCR obtained as one cassette with primers ga24F and ga2R 
containing Afllll and Bglll sites, respectively. The product was cloned as a 
replacement Afllll/EcoRI to pCGA1 22-30 producing pCGA126-1 (see Figure 5 

5 and SEQ ID NO: 46). 

The beta-lactamase. ipla) gene was inactivated by interruption at the Seal 
site. This was followed by treatment with an exonuclease (Bal31) and 
subsequent filling in with deoxynucleotide tri-phosphates (dNTPs). The plasmid 
0 was finally re-ligated resulting in a deletion of about 60 bases and thus forming 
the plasmid pCGA139~1 (see Figures 5 and 6). 

pCGA126-1 was sequenced in its entirety and the deletion at the Seal 
site (thereby giving rise to the pGCA13&-1 plasmid) was confirmed by 
5 sequencing. 

Plasmid pCGA139-1 was transformed into a BL21 E. coli host strain to 
optimize protein expression. Twenty microliters of BL21 competent cells were 
pipetted into a pre-chilled 1.5 ml polypropylene tube. One microliter of vector 
!) was added and the mixture was incubated on ice for 5 min. The cells were heat 
shocked in a 42°C water bath for exactly 30 sec followed by incubation on ice 
for 2 min. SOC medium (80 ^l) was added followed by incubation at 37°C with 
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shaking at 250 rpm for 1 hr. The culture was plated on 2XYT agar containing 50 
jug/ml kanamycin and plates were incubated overnight at 37°C. The recipes for 
terrific medium and SOC broth can be found in standard books on cell culture 
(for example, see Sambrook et al (1989), supra, at book 3, appendix page A.2.). 

Plasmid preparations and frozen 15-20% glycerol stocks were made from 
individual colonies grown overnight in terrific broth. Candidates were chosen by 
both the absence of the Seal site and. by sensitivity to 50 ng/ml ampicilin. 
PlasmkJs were further analyzed for production of target protein. pCGA139-1 was 
cloned by plating on 2XYT agar containing 50 ng/ml kanamycin. Six clones were 
analyzed according to their restriction patterns, Western blot analysis, and 
production of FimCH protein. A single colony was grown overnight in broth and 
stored in 15%-20% sterile glycerol in Nunc vials at a temperature of-70°C. 

Overnight cultures were diluted 1:30 in terrific broth containing 50 ng/ml 
kanamycin and grown at 37°C to mid-log phase (about 0.3 at ODeoo). Fifteen ml 
of each culture was induced with 2.0 mM IPTG and harvested after 3 hr. Several 
1 ml aliquots from each sample were sedimented in an eppendorf centrifuge at 
14,000 rpm for 2 min. Total protein was estimated by BCA assay and 1.0 \xg 
total protein of uninduced and induced culture was loaded to two polyacrylamide 
gels for electrophoresis against FimCH standards of known concentration. FimC 
and FimH were assayed on separate gels/membranes. Samples were also 
assayed via ion exchange chromatography for levels of FimCH protein. 

Proteins were transferred to nitrocellulose membranes via Western blot, 
blocked with 2% dried milk and treated with primary polyclonal antibodies to 
FimC or FimH-T3. Membranes were washed 3X (15 min each wash) with PBS 
plus 0.01% Tween 20 after which a donkey anti-rabbit secondary antibody 
conjugated to horseradish peroxidase (HRP) was applied for 1 hr. Membranes 
were washed as described before followed by treatment with an anti-HRP 
detection reagent, ECL, or ECL-plus. Nitrocellulose was finally exposed to x-ray 
films and developed in a M35-A x-omatic processor. 
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For ion exchange chromatography of the FimCH product, all samples 
were resuspended in 200 ^il of PBS, sonicated for 12 min, and diluted 4 fold with 
PBS. Each sample was centrifuged at 10,000 rpm for 3 min into a 0.45 micron 
spin filter unit and transferred into HPLC microvials for analysis. A Pharmacia 

5 Mono-S HR 5/5 column (5 mm x 50 mm) was used for the quantification of pilus 
proteins in analyzed samples. Mobile phase A was 20 mM potassium phosphate 
(pH 7.0); mobile phase B was mobile phase A but containing 0.5 M potassium 
chloride. A gradient of 0%-30% B over 20 minutes was run at a flow rate of 1 .25 
ml/min. Eluted protein was detected using intrinsic tryptophan fluorescence 

10 detection (excitation 280 nm, emission 335 nm). A standard curve was 
generated using reference standard material diluted to concentrations from 5.2 
pg/ml to 15.6 jig/ml. The correlation coefficient of the calibration curve was 
>0.995. The concentration of FimCH was determined using regression analysis 
from a standard curve of the area under the product peak. 

15 

Construction of Vector pPW19R 

This vector was constructed by inserting an intergenic region derived 
20 from pPW14R as a Not1-Nco1 fragment into pPW19 creating an additional 
lacP/O, RBS, and PelB leader. pPW19 represents a modification of the vector 
pPW16, which was made by cloning the coding sequence for the c-myc epitope 
into pPW8 as a Not1-Sal1 linker. 

25 pPW8 is a modification of pPW6, which is a derivative of pPW4, the latter 

being engineered by insertion of a 60 bp synthetic linker containing a Not1 site, 
a 4D4 epitope sequence, and a termination codon into pPW2 as a BamH1-Sal1 
fragment 

30 pPW2 was made by cloning the full length gene3 (from pSPLIII (JAS)) 

into pPW1, the latter prepared by cloning the PelB leader sequence (an amino 
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terminal sequence directing periplasmic insertion of pectin lysate in E. coli) into 
the vector pUC1 18. 



5 Table 3. Primer Sequences used for Vector Construction. 



Primer* 
NO: 



Sequence 



SEQ ID 



10 GA1F 5'-CCTGCCATGGCGGGTGTGGCGCTGGGTGCGA 56 

CCCGCGTGATTTATCCGGCAGGGC-3' 
GA2R 5'-GGCGTCGACAGATTCTATTATTCCATTACGC 57 

CCGTC-3' 



15 GA13F 5'-CACACAGGAMCAGCTATGATTGTAATGAAA 



58 



20 



GA6R 



GA11F 
GA9R 



ACGAG-3' 

5'-GGCGTCGACGGATCCTTATTGATAAACAAAA 
GTCACGCC-3' 

S'-CCGAATAAAGATATCACGACAGGTTTCCCG-y 
5-CATAGCTGTTTCCTGTGTG-3' 



59 



60 
61 



GA24F S'-TGCTCACATGrrCTTTCCTGCGT-y 62 
25 GA23R 5'^ACGTTTTTATTACTCATAGCTGTTTCCTGTGTG-3' 63 



GA21F ff-ATGAGTAATAAAAACGTCAATGTAAGGAAA 
TCGCAGG-3' 



64 



30 



* "P refers to forward primers and "R" refers to reverse primers. 
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The nucleotide sequences corresponding to the structures used in 
cloning FimCH were as follows: SEQ ID NO: 47 shows J96 FimC plus a signal 
sequence and SEQ ID NO: 48 shows J96 FimH; SEQ ID NO: 49 shows the 
5 sequence for the kanamycin resistance gene; SEQ ID NO: 50 shows the 
sequence for lac fi\ SEQ ID NO: 51 shows the sequence of Beta-lactamase 
(with the deletions described in Example 2 covering residues 285 - 335 ), SEQ 
ID NO: 52 shows the sequence of the origin of replication, and SEQ ID NO: 53 
shows the sequence of Lac p/o (where the two promoter sequences are found 
10 at (1) residues 100 to 215, and (2) residues 990 to 1 105 of the plasmid (SEQ ID 
NO: 46)). 

Thus, the present invention also relates to a new method of preparing 
polypeptides from recombinant cells using a vector comprising the plasmid of 

15 Figures 3 through 6. In specific embodiments, the polypeptides comprise the 
amino acid sequences of Figure 2. In one embodiment, this process and 
plasmid can be used to prepare polypeptides that comprise a bacterial 
chaperone, such as FimC, fused to a bacterial adhesin, such as FimH or any of 
the polypeptides presented in Figure 2, including any consensus sequence 

20 derived therefrom. The plasmid based procedure of the invention can be used to 
prepare any polypeptides but especially finds use in preparing the chaperone 
complexes of the adhesin proteins presented herein. Thus, for example, large 
amounts of FimCH, the complex of the periplasmic bacterial chaperone, FimC, 
and the bacterial adhesin, FimH, are readily prepared using the processes 

25 disclosed herein. 

Numerous modifications and variants of the invention are possible in light 
of the above teachings; therefore, within the scope of the appended claims, the 
invention may be practiced otherwise than as specifically described. 
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WHAT IS CLAIMED IS : 

1 . An immunogenic polypeptide comprising a sequence selected from 
the group consisting of residues 26 to 186 of SEQ ID NO: 23 to 45, and 55. 

5 

2. The immunogenic polypeptide of claim 1 further comprising about the 
N-terminal two thirds of the sequences selected from SEQ ID NO: 23 to 45, 
and 55. 

10 3. The immunogenic polypeptide of claim 1 wherein said polypeptide 

has an amino acid sequence selected from the group consisting of SEQ ID 
NO: 23 to 45, and 55. 

4. The immunogenic polypeptide of claim 1 wherein said sequence is 
15 SEQ ID NO: 55. 

5. The immunogenic polypeptide of claim 2 wherein said sequence is 
SEQ ID NO: 55. 

20 6. The immunogenic polypeptide of claim 3 wherein said sequence is 

SEQ ID NO: 55. 

7. An isolated polynucleotide encoding a polypeptide of claim 1. 

25 8. An isolated polynucleotide encoding a polypeptide of claim 3. 

9. An isolated polynucleotide encoding a polypeptide of claim 6. 

10. A vaccine composition comprising an immunogenically effective 
30 amount of an immunogenic polypeptide of claims 1, 2, or 3, wherein said 

immunogenic polypeptide is in a pharmacologically acceptable carrier. 
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11. A vaccine composition comprising an immunogenically effective 
amount of an immunogenic polypeptide of claims 4, 5, or 6, wherein said 
immunogenic polypeptide is in a pharmacologically acceptable carrier. 

5 12. A vaccine composition comprising an immunogenically effective 

amount of an immunogenic polypeptide of claims 6, wherein said 
immunogenic polypeptide is in a pharmacologically acceptable carrier. 

13. An antibody that binds to a polypeptide having an amino acid 
10 sequence selected from SEQ ID NO: 23 to 45, and 55. 

14. The antibody of claim 13 wherein said antibody is a monoclonal 
antibody. 

15 15. A process for protecting against an enterobacillus-related disease in 

a patient at risk of contracting such disease comprising administering to said 
patient an effective amount of the vaccine composition of claim 10, 1 1 and 12. 

16. The method of claim 15 wherein said disease is a urinary tract 
20 infection. 

17. The method of claim 15 wherein the disease is a bladder infection. 

18. The method of claim 15 wherein the disease is caused by a 
25 bacterium of the family Enterobacteriaceae. 

19. The method of claim 18 wherein the bacterium is £. colL 

20. A process for treating an enterobacillus-related disease in a patient 
30 afflicted therewith comprising administering to said patient animal an effective 

amount of a vaccine composition of claim 10, 1 1 or 12. 
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21. The method of claim 20 wherein said disease is a urinary tract 
infection. 

22. The method of claim 20 wherein the disease is a bladder infection. 

5 

23. The method of claim 20 wherein the disease is caused by a 
bacterium of the family Enterobacteriaceae. 

24. The method of claim 23 wherein the bacterium is E colL 

10 

25. A process for treating an enterobacillus-related disease in a patient 
afflicted therewith comprising administering to said patient an effective 
amount of an antibody of claim 13 or 14. 

15 26. The method of claim 25 wherein said disease is a urinary tract 

infection. 

27. The method of claim 25 wherein the disease is a bladder infection. 

20 28. The method of claim 25 wherein the disease is caused by a 

bacterium of the family Enterobacteriaceae. 

29. The method of claim 28 wherein the bacterium is E. coli. 

25 30. A recombinant cell expressing a polypeptide comprising an amino 

acid sequence selected from the group consisting of SEQ ID NO: 23 to 45, 
and 55. 

31. The recombinant cell of claim 30 wherein the amino acid sequence 
30 is the sequence of SEQ ID NO: 55. 
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32. A vector comprising a polynucleotide encoding a polypeptide 
comprising an amino acid sequence selected from the group consisting of 
SEQ ID NO: 23 to 45, and 55. 

5 33. The vector of claim 32 wherein the amino acid sequence is the 

sequence of SEQ ID NO: 55. 

34. The vector of claim 32 wherein said vector comprises the plasmid 
pCGA139-1-1. 

10 

35. The vector of claim 33 wherein said vector comprises the plasmid 
pCGA139-1-1. 

36. A process for producing a polypeptide comprising expressing said 
15 polypeptide from a recombinant cell containing the vector of claim 34. 

37. The process of claim 36 wherein said polypeptide comprises an 
amino acid sequence selected from the group consisting of SEQ ID NO: 23 to 
45, and 55. 

20 

38. The process of claim 37 wherein said polypeptide further comprises 
a bacterial chaperone fused to a bacterial adhesin. 
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Figure 2 (a) 



1 50 

B210.aa (1) FACKTANGTAI PIGGGSANVYVXJIAPVVNVGQNLVVDIiSTQIPCHiroy PE 

B212.aa (1) FACKTANGTAIPIGGGSANVYVin^AVm^6QNI*VVDI>STQIFCHNDYPE 

B217 .aa (1) FACKTANGTAI P I GGG S ANVYVNLAPAVNVGQNLVVDLSTQ I FCHNDY PE 

B223.aa (1) FACKTANGTAIPIGGGSAlJVYVNLAPAVNVGQNLV^ 

B228.aa (1) FAC^ANGTAIPIGGGSANVYVNIAPAVNVGQl^VVDI^ 

B238.aa (1) FACKTANGTAI PIGGGSANVYVNIAI AVNVGQNLVVDLSTQI FCHNDYPE 

B240.aa (1) FAOTTANGTAIPIGGGSANVYVNLAPVVNVGQNLVVDLSTQIFCHNDYPE 

B242.aa (1) FACKTANGTAI P I GGGSANVYVNLAPAVNVGQNLVVDLSTQTFCHNDYPB 

DS17.aa (1) FACKTANGTAIPIGGGSANVYVNIiAPAYOTGQNLVVDLSTQIFCHNDYPE 

EC42 . aa ( 1 ) FACKTANGTAIPIGGGSANVYVNLAPAVNVGQNIiWDLSTQIFCHNDYPE 

EC45.aa (1) FACKTANGTAI P I GGGS ANVYVNLAPAVNVGQNLVVDLSTQI FCHNDYPE 

EC56.aa (1) FACKTANGTAI PIGGGSANVYVNIiAPVVNVGQNLVVDIiSTQI FCHNDYPE 

ECS 8 . aa ( 1 ) FACKTANGTAIPIGGGSANVYVNIAPAVNVGQNLVVDLSTQIFCTOTYPE 

EC60.aa (1) FACKTANGTAIPIGGGSANVYVNLAPVVNVGQN^ 

EC61.aa (1) FACKTANGTAIPIGGGSANVYVNLAPAVXW3QNLVTO 

EC62.aa (1) FACKTANGTAI P IGGGSANVYVNIAPAVNVGQNLVVDLSTQIFCHNDYPE 

EC80.aa (1) FACKTANGTAI PI GGGSANVYVNLAPAVNVGQNL WDLS TQI FCHNDYPE 

EC89.aa (1) FACKTANGTAI PIGGGSANVYVinjAPAVNVGQNLVVDLSTQIFCHNDYPE 

EC95.aa (1) FACKTANGTAI P IGGGSANVYVNLAPAVOTGQNLWDLS TQI FCHNDYPE 

G189.aa (1) FACKTANGTAIPIGGGSANVYVNLAPAVNVGQNLV^ 

J96.aa (1) FACKTANGTAI P I GGG S ANVYWLAPVVNVGQNLVVDL S TQ I FCHNDY PE 

NU14.aa (1) FACKTANGTAIPIGGGSANVYVNLAPAVNVGQNLVTO^ 

Consensus (1) FACKTANGTAI PIGGGSANVYVNLAPAVNVGQNLVVDLSTQIFCHNDYPE 
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B210 


.aa 


(51) 


TITOYVTLQRGSAYGGVLSNFSGIVKYSGSSYPFPTTSETPRWYNSRTD 


B212 


.aa . 


(51) 


TITDYVTLQRGSA YGGVLS S FSGIVKYNGS S YPFPTTS ETPRVVYNSRTD 


B217 


.aa 


(51) 


TITDYVTLQRGAAYGGVLSSFSGTVKYNGS S YPFPTTSETPRWYNSRTD 


B223 


.aa 
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TITDYVTLQRGAAYGGVLSSFSGTVKYNGSSYPFPTTSETPRVVYNSRTD 


B228 


.aa 


(51) 


TITDYVTLQRGSAYGGVLSNFSGTVKYSGSSYPFPTTSETPRWYNSRTD 


B238 


.aa 


(51) 


TITDYVTLQRGSAYGGVLSNFSGTVKYSGSSYPFPTTSETPRWYNSRTD 


B240 


.aa 


(51) 


TI TDYVTLQRGSAYGGVLSNFSGTVKY SGS S YPFPTTSETPRWYNSRTD 


B242 


.aa 


(51) 


TITDYVTLQRGSAYGGVIiSNFSGTVKYSGSSYPFPTTSETPRWYNSRTD 


DS17 


.aa 


(51) 


T I I'D YVTLQRGSAYGGVLS SFSGTVKYNGS S YPF PTTSETPRWYNSRTD 


EC42 


.aa 


(51) 


T ITDYVTLQRGSAYGGVLSNF S GTVKYSGS S YP FPTTSETPRWYNSRTD 


EC45 


.aa 


(51) 


T I TDYVTLQRGAAYGGVLSSF SGTVKYNGS S YPFPTTSETPRWYNSRTD 


EC56 


.aa 


(51) 


TITDYVTLQRGSAYGGVLSNFSGTVKYSGS SYPFPTTSETPRWYNSRTD 


EC58 


.aa 


(51) 


TITDYVTLQRGSA YGSVLS SFSGTVKYNGS SYPFPTTSETPRWYNSRTD 


EC60 


.aa 


(51) 


T I TDYVTLQRGSAYGGVIjSNFSGTVKYSGSS YPFPTTSETPRWYNSRTD 


EC61 


.aa 


(51) 


TITDYVTLQRGSAYGGVLSNFSGTVEYSGSSYPFPTTSETPRWYNSRTD 


EC62 .aa 


(51) 


TITDYVTLQRGSAYGGVLSHFSGTVKYSGSS YPFPTTSETPRWYNSRTD 
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.aa 


(51) 


TITOYVTLQRGSAYGGVLSSFSGTVKYNGSSYPFPTTSBTPRVVYNSRTD 


EC89 


.aa 


(51) 


TITDYVTLQRGSAYGGVLS SF SGTVKYNGS SYPFPTTSETPRWYNSRTD 


BC95 


.aa 


(51) 


T I TDYVTLQRGSAYGGVIjS SFSGTVKYNGS S YPFPTTSETPRWYNSRTD 


G189 


.aa 


(51) 


TI TDYVTLQRGSAYGGVLSNFSGTVKY SGS SYPFPTTSETPRWYNSRTD 


J96 .aa 


(51) 


TITOYVTLQRGSAYGGVLSNFSGTVKYSGS SYPFPTTSETPRWYNSRTD 


NU14 


.aa 


(51) 


TITDYVTLQRGAAYGGVLSSFSGTVKYNGS SYPFPTTSETPRWYNSRTD 


Consensus 


(51) 


TITDYVTLQRGSA YGGVLSNF SGTVKYSGS SYPFPTTSETPRWYNSRTD 
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Figure 2 (b) 
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KPWPVALYLTPVSSAGGVAIKAGSLIAVLII^QTNNYNSDDFQFVWNIYA 

KPWPVAL YLTPVS saggvai ragsli avlilrqtnnynsddfqfvwniya 

KPWPVAIiYLTPVSSAGGVAlKAGSLIAVLILRQTNNYNSDDFQFVWTOYA 
KPWPVALYLTPVSSAGGVAIKAGSLIAVLILRQTNNYNSDDFQFVWNIYA 
KPWPVALYLTPVSSAGGVAIKAGSLIAVLILRQTNNYNSDDFQFVWNIYA 
KPWPV7^YLTPVSSAGGWIKAGSLIAVLILRQTNNYNSDDFQFVWNIYA 
KPWPVALYLTPVSSAGGLVIKAGSLIAVLILRQTONYNSDDFQFVWNIYA 
KPWPVALYLTPVSSAGGVAIKAGSLIAVLILRQTNNYWSDDFQFVWNIYA 
KPWPVALYLTPVS SAGGVAIKAGSLIAVLILRQTNNYNSDDFQFVWNIYA 
KPWPVALYIiTPVS SAGGWIKAGSLIAVLILRQTNNYNSDDFQFVWNIYA 
KPWPVALYLTPVSSAGGVAIKAGSLIAVLILRQTNNYNSDDFQFVWNIYA 
KPWPVALYLTPVSSAGGVAIKAGSLIAVLILRQTNOTNSDDFQFVWNIYA 
KPWPVAIiYLTPVSSAGGVAIKAGSLIAVLILRQTNNYNSDDFQFWNIY^ 
KPWPVALYLTPVSSAGGVVIKAGSLIAVLILRQTNNYNSDDFQFVWNIYA 
KPWPVALYLTPVS SAGGVAI KAGSLIAVLILRQTNNYN SDDFQFVWNIYA 
KPWPVALYLTPVS SAGGVAIKAGSI^VLILRQTlSnmiSDDFQFVWNIYA 
KPWPVALYIiTPVS SAGGVAI KAGSLI AVLILRQTNNYNSDDFQFVWNIYA 
KPWPVALYXTPVSSAGGVAIKAGSLIAVLILRQTNNYNSDDFQFVWNIYA 
KPWP VALYLTL VS SAGGVAI KAG S LIAVL ILRQTNNYNSDDFQFVWNI YA 
KPWPVALYLTPVS SAGGVAI KAGSLIAVLILRQTKNYNSDDFQFVWNT YA 
KPWPVALYLTPVS SAGGVAI KAGSLIAVLILRQTNNYNSDDFQFVWNIYA 
KPWPVTU^YLTPVSSAGGVMKAGSLIAVLILRQTNNYNSDDFQFVWNIYA 
KPWPVALYLTPVSSAGGVAIKAGSLIAVLILRQTNNYNSDDFQFVWNIYA 
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151 200 
NNDVWPTGG CDA S ARDVTVTLPD YRGS VPI PLTVYCAKS QNLGYYL SGT 
NNDVVVPTGGOTASARDVTVTLPDYRGSVPI PLTVYCAKSQNLGYYLSGT 
JSnroVWPTGGOTVSARDVTVTLPDYRGSVPI PLTVYCAKSQNLGYYLSGT 
NNDVWPTGGCDVSARDVTVTLPDYRGSVPI PLTVYCAKSQNLGYYLSGT 
NNDVWPTGGCDVSAHDVTVTLPDYRGSVPI PLTVYCAKSQNLGYYLSGT 
NNDVVVPTGGCDVSARDVTVTLPDYRGSVPI PLTVYCAKSQNLGYYLSGT 
NNDVVVPTGGCDVSARDVTVTLPDYRGS VPI PLTVYCAKSQNLGYYLSGT 
NNDVWPTGGCDVSAHDVTVTLPDYRGSVPI PLTVYCAKSQNLGYYLSGT 
NITOWVPTGGCDASARDVTVTLPDYRGSVPIPLTW 
NNDVVVPTGGCDVSARD VTVTL PDYPGSVPI PLTVYCAKSQNLGYYLSGT 
N2TOVVVPTGGCDVS ARDVTVTLPDYPGSVPI PLTVYCAKSQNLGYYLSGT 
NNDVWPTGGCDVSARDVTVTLPDYRGSVPI PLTV^ 
NNDVWPTGGCDVSARDVTVTLPDYRGSVPI PLTVYCAKSQNLGYYLSGT 
NHDWVPTGGCDVSARDVTVTLPDYRGSVPI PLTVYCAKSQNLGYYLSGT 
NNDVVVPTGGCDVSARDVTVTLPDYRGSVPI PLTVYCAKSQNLGY 
NNDVWPTGGCDVSARDVTVTLPDYRGSVPI PLTVYCAKSQNLGYYLSGT 
NJSTDVVVPTGGCDVSARDVTVTLPDYRGSVPI PLTVYCAKSQNLGYYLSGT 
N1TOVVVPTGGCDVSARDVTVTLPDYRGSVPIPLTVYCAKSQNLGYYLSCT 
NNDVWPTGGCDVSARDVTVTLPDYRGSVPI PLTVYCAKSQNLGYYLSGT 
NNDVVVPTGGCTVSARDVTVTLPDYRGSVPIPLTWCAKSQ 
NNDVWPTGGCX>VSARDVT\miPDYPGSVPIPLTVYCAKSQNLGYYLSGT 
NNDVVVPTGGCTVSARDVTVTLPDYRGSVPIPLTWOVKSQNLGYYLSGT 
NNDVWPTGGCDVSARDVTVTLPDYRGSVPI PLTVYCAKSQNLGYYLSGT 
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Figure 2(c) 
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201 250 
HADAGNS I FTNTASFS PAQGVGVQLARNGTVI PANNTVS LGAVGT S AVS L 
HADAGNS I FTNTAS FS PAQGVGVQLTRNGTI I PANNTVSLGAVGTSAVSL 
HADAGNS I FTNTASFS PAQGVGVQLTRNGTI IPANNTVSLGAVGTS AVSL 
HADAGNS I FTNTAS FSPAQGVGVQLTRNGTI I PANNTVSLGAVGTSAVSL 
HADAGNS I FTNTAS F S PAQGV GVQLTRNGTI I PANNTVSLGAVGTS AVS L 
HADAGN S I FTNTAS FS PAQGVGV QLTRNGT 1 1 PANNTVS LGAVGTS AVS L 
HADAGNS I FTNTAS FSPAQGVGVQLTRNGT 1 1 PTNNTVS LGAVGTS AVS L 
HADAGNS I FTNTASFS PAQGVGVQLTRNGTI I PANNTVSLGAVGTS AVS L 
HADAGNS IFTNTASFSPAQGVGVQLTRNGTI I PANNOTS LGAVGTSAVSL 
TADAGNS I FTNTASFS PAQGVGVQLTRNGTI I PANNTVSLGAVGTSAVSL 
TADAGNS I FTNTAS FS PAQGVGVQLTRNGTI I PANNTVSLGAVGTS AVSL 
TADAGNSIFTNTASFSPAQGVGVQLTRNGTIIPA1«NTVSLGAVCTSAVSL 
HADAGNS I FTNTAS F S P AQGVGVQLTRNGT I IPANNTVSLGAVGTS AVSL 
HADAGNS I FTNTAS FS PAQGVGVQLTRNGTI I PANNTVSLGAVGTS AVS L 
HADAGN S I FTNTAS FS PAQGVGV QLTRNGT 1 1 PANNTVS LGAVGTSAVSL 
HADAGNS I FTNTAS FSPAQGVGVQLTRNGTINPANNTVSLGAVGTS AVSL 
HADAGNS I FTNTASFS PAQGVGVQLTANGTIVPANNTVSIjGAVGTSAVSL 
HADAGNS I FTNTAS FS PAQGVGVQLTANGTIVPANNTVS LGAVGTSAVSL 
HADAGNSIFTNTASFSPAQGVGVQLTRNGTIIPANNTVSLGAVGTSAVSL 
HADAGNS I FTNTAS F S PAQGVGV QLTRNGT 1 1 PANNTVS LGTVGT S AVS L 
TADAGNS IFTNTASFSPAQGVGVQLTRNGTI I PANNTVSLGAVGTS AVSL 
HADAGNS I FTNTASFS PAQGVGVQLTRNGTI I PANNTVSLGAVGTSAVSL 
HADAGNS I FTNTAS F S PAQGVGV QLTRNGT 1 1 PANNTVSLGAVGTSAVSL 



251 279 
GLTANYARTGGQVTAGNVQS 1 1 GVTFVYQ 
GLTANYARTGGQVTAGNVQS 1 1 GVTFVYQ 
GLTANYARTGGQVTAGNVQS 1 1 GVTFVYQ 
GLTANYARTGGQVTAGNVQS 1 1 GVTFVYQ 
GLTANYARTGGQVTAGNVQS 1 1 GVTFVYQ 
GLTANYARTGGQVTAGNVQS 1 1 GATFVYQ 
GLTANYARTGGQVTAGNVQS 1 1 GVTFVYQ 
GLTANYARTGGQVTAGNVQS 1 1 GVTFVYQ 
GLTANYARTGGQVTAGNVQS 1 1 GVTFVYQ 
GLTANYARTGGQVTAGNVQS 1 1 GVTFVYQ 
GLTANYARTGGQVTAGNVQS 1 1 GVTFVYQ 
GLTANYARTGGQVTAGNVQS I IGVTFVYQ 
GLTANYARTGGQVTAGNVQS I IGVTFVYQ 
GLTANYARTGGQVTAGNVRS I IAVTFVYQ 
GLTANYARTGGQVTAGNVQS I IGVTFVYQ 
GLTANYARTGGQVTAGNVQS 1 1 GVTFVYQ 
GLTANYARTGGQVTAGNVQS I IGVTFVYQ 
GLTANYARTGGQVTAGNVQS 1 1 GVTFVYQ 
GLTANYARTGGQVTAGNVQS 1 1 GVTFVYQ 
GLTANYARTGGQVTAGNVQS I IGVTFVYQ 
GLTANYARTGGQVTAGNVQS I IGVTFVYQ 
GLTANYARTGGQVTAGNVQS I IGVTFVYQ 
GLTANYARTGGQVTAGNVQSI IGVTFVYQ 
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fit ? 



Stepl : Construction of pCGAl 01-8 
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Step 2: Construction of pCGA1 22-30 
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Step 3: Selection of final clone 
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lacP/O NSS 




ga2R 
(Bglll) 



TSA(+Ka/7) 



Avon{5M9) 



AvafHS2l6) 



lacP 

AJl III (4664) 



lacP/O 

Avail (553) 

. fimC 

.'Bst II (944) 

; lac P/O 

' m 4m 11(1387) 

fimH 

BoM I (2010) 
Sa/J (201ft) 
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SEQUENCE LISTING 



<110> Langermann, Solomon 
Revel , Andrew 
Auguste, Christine 
Burlein , Jeanne 

<120> FimH Adhesin Proteins and Methods of Use 

<130> 469201-549 

<150> US/60/216,750 
<151> 2000-07-07 

<160> 64 

<170> Patentln version 3.0 

<210> 1 

<211> 837 

<212> DNA 

<213> E. COli 

<400> 1 

ttcgcctgta aaaccgccaa tggtaccgct atccctattg gcggtggcag cgccaatgtt 60 

tatgtaaacc ttgcgcccgt cgtgaatgtg gggcaaaacc tggtcgtgga tctttcgacg 120 

caaatctttt gccataacga ttatccggaa accattacag actatgtcac actgcaacga 180 

ggctcggctt atggcggcgt gttatctaat ttttccggga tcgtaaaata tagtggcagt 240 

agctatcctt tccctaccac cagcgaaacg ccgcgcgttg tttataattc gagaacggat 300 

aagccgtggc cggtggcgct ttatttgacg cctgtgagca gtgcgggggg agtggcgatt 360 

aaagcaggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 420 

ggtttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc cactggcggc 480 

tgcgatgctt ctgctcgtga tgtcaccgtt actctgccgg actaccctgg ttcagtgccg 54 0 

attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccggcaca 600 

accgcaggtg cgggcaactc gattttcacc aataccgcgt cgttttcacc cgcgcagggc 660 

gtcggcgtac agttggcgcg caacggtacg gttattccag cgaataacac ggtatcgtta 720 

ggagcagtag ggacttcggc ggtgagtctg ggattaacgg caaattacgc acgtaccgga 780 

gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaa 837 



<210> 2 

<211> 837 

<212> DNA 

<213> B. COli 

<400> 2 

ttcgcctgta aaaccgccaa tggtaccgct atccctattg gcggtggcag cgccaatgtt 60 

tatgtaaacc ttgcgcctgc cgtgaatgtg gggcaaaacc tggtcgtgga tctttcgacg 120 

caaatctttt gccataacga ttacccggaa accattacag actatgtcac actgcaacga 180 

ggttcggctt atggcggcgt gttatctagt ttttccggga tcgtaaaata taatggcagt 240 

agctatcctt tccctactac cagcgaaacg ccgcgggttg tttataattc gagaacggat 300 

aagccgtggc cggtggcgct ttatttgacg cctgtgagca gtgcgggggg agtggcgatt 360 

aaagcaggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 420 
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gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc cactggcggc 480 

tgcgatgctt ctgctcgtga tgtcaccgtt actctgccgg actaccctgg ttcagtgccg 540 

attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccggcaca 600 

accgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc cgcgcagggc 660 

gtcggcgtac agttgacgcg caacggtacg attattccag cgaataacac ggtatcgtta 720 

ggagcagtag ggacttcggc ggtaagtctg ggattaacgg caaattacgc acgtaccgga 780 

gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaa 837 

<210> 3 
<211> 837 
<212> DNA 
<213> E. coli 

<400> 3 

ttcgcctgta aaaccgccaa tggtacagct atccctattg gcggtggcag cgctaatgtt 60 

tatgtaaacc ttgcgcctgc cgtgaatgtg gggcaaaacc tggtcgtaga tctttcgacg 120 

caaatctttt gccataacga ttatccggaa accattacag actatgtcac actgcaacga 180 

ggctcggctt atggcggcgt gttatctaat ttttccggga ccgtaaaata tagtggcagt 240 

agctatccat ttccgaccac cagcgaaacg ccgcgggttg tttataattc gagaacggat 300 

aagccgtggc cggtggcgct ttatttgacg cctgtgagca gtgcgggcgg ggtggcgatt 360 

aaagctggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 420 

gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc tactggcggc 480 

tgcgatgttt ctgctcgtga tgtcaccgtt actctgccgg actaccctgg ttcagtgcca 540 

attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccggcaca 600 

accgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc agcgcagggc 660 

gtcggcgttc agttgacgcg caacggtacg attattccca cgaataacac ggtatcgtta 720 

ggagcagtac ggacttcggc ggtaagtctg ggattaacgg caaattacgc acgtaccgga 780 

gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaa 83 7 

<210> 4 
<211> 840 
<212> DNA 
<213> E. coli 

<400> 4 

ttcgcctgta aaaccgccaa tggtaccgca atccctattg gcggtggcag cgccaatgtt 60 

tatgtaaacc ttgcgcctgc cgtgaatgtg gggcaaaacc tggtcgtaga tctttcgacg 120 

caaatctttt gccataacga ttacccagaa accattacag actatgtcac actgcaacga 180 

ggtgcggctt atggcggcgt gttatctagt ttttccggga ccgtaaaata taatggcagt 240 

agctatcctt tccctactac cagcgaaacg ccgcgggttg tttataattc gagaacggat 300 

aagccgtggc cggtggcgct ttatttgacg ccggtgagca gtgcgggggg agtggcgatt 360 

aaagctggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 420 

gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc cactggcggc 480 

tgcgatgttt ctgctcgtga tgtcaccgtt actctgccgg actaccctgg ttcagtgccg 540 

attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccggcaca 600 

accgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc cgcgcagggc 660 

gtcggcgtac agttgacgcg caacggtacg attattccag cgaataacac ggtatcgtta 720 

ggagcagtag ggacttcggc ggtaagtctg ggattaacgg caaattacgc acgtaccgga 780 

gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaataa 84 0 

<210> 5 
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<211> 840 
<212> DNA 
<213> E. coli 

<400> 5 

ttcgcctgta aaaccgccaa tggtaccgct attcctattg gcggtggcag cgctaatgtt 60 

tatgtaaacc ttgcgcctgc cgtgaatgtg gggcaaaacc tggtcgtaga tctttcgacg 120 

caaatctttt gccataacga ttatccggaa accattacag actatgtcac actgcaacga 180 

ggctcggctt atggcggcgt gttatctaat ttttccggga ccgtaaaata tagtggcagt 240 

agctatccat ttccgactac cagcgaaacg ccgcgggttg tttataattc gagaacggat 300 

aagccgtggc cggtggcgct ttatttgacg cctgtgagca gtgcgggtgg ggtggcgatt 360 

aaagctggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 420 

gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc tactggcggc 480 

tgcgatgttt ctgctcatga tgtcaccgtt actctgccgg actaccctgg ttcagtgcca 540 

attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccggcaca 600 

accgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc agcgcagggc 660 

gtcggcgtac agttgacgcg caacggtacg attattccag cgaataacac ggtatcgtta 720 

ggagcagtag ggacttcggc ggtaagtctg ggattaacgg caaattacgc acgtaccgga 780 

gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaataa 840 

<210> 6 
<211> 837 
<212> DNA 
<213> E. coli 

<400> 6 

ttcgcctgta aaaccgccaa tggcaccgct atccctattg gcggtggcag cgccaatgtt 60 

tatgtaaaca ttgcgcccgc cgtgaatgtg gggcaaaacc tggtcgtgga tctttcgacg 120 

caaatctttt gccataacga ttacccggaa accattacag attatgtcac actgcaacga 180 

ggctcggctt atggcggcgt gttatctaat ttttccggga ccgtaaaata tagtggcagt 240 

agctatccat ttccgaccac cagtgaaacg ccgcgggttg tttataattc gagaacggat 3 00 

aagccgtggc cggtggcgct ttatttgacg cctgtgagca gtgcgggcgg ggtggtgatt 360 

aaagctggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 420 

gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc cactggcggc 480 

tgcgatgttt ctgctcgtga tgtcaccgtt actctgccgg actaccctgg ttcagtgccg 540 

attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccggcaca 600 

accgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc tgcacagggc 660 

gtcggcgtac agttgacgcg caacggtacg attattccag cgaataacac ggtatcgtta 720 

ggagcagtag ggacttcggc ggtaagtctg ggattaacgg caaattacgc acgtaccgga 780 

gggcaggtga ctgcagggaa tgtgcaatcg attattgccg tgacttttgt ttatcaa 837 

<210> 7 
<211> 837 
<212> DNA 
<213> E. coli 

<400> 7 

ttcgcctgta aaaccgccaa tggtaccgct atccctattg gcggtggcag cgccaatgtt 60 

tatgtaaacc ttgcgcccgt cgtgaatgtg gggcaaaacc tggtcgtgga tctttcgacg 120 

caaatctttt gccataacga ttatccggaa accattacag actatgtcac actgcaacga 180 

ggctcggctt atggcggcgt gttatctaat ttttccggga ccgtaaaata tagtggcagt 240 

agctatccat ttcctaccac cagcgaaacg ccgcgcgttg tttataattc gagaacggat 300 
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aagccgtggc 
aaagctggct 
gatttccagt 
tgcgatgttt 
attcctctta 
accgcagatg 
gtcggcgtac 
ggagcagtag 
gggcaggtga 



cggtggcgct 
cattaattgc 
ttgtgtggaa 
ctgctcgtga 
ccgtttattg 
cgggcaactc 
agttgacgcg 
ggacttcggc 
ctgcagggaa 



ttatttgacg 
cgtgcttatt 
tatttacgcc 
tgtcaccgtt 
tgcgaaaagc 
gattttcacc 
caacggtacg 
ggtaagtctg 
tgtgcaatcg 



cctgtgagca 
ttgcgacaga 
aataatgatg 
actctgccgg 
caaaacctgg 
aataccgcgt 
attattccaa 
ggattaacgg 
attattggcg 



gtgcgggcgg 
ccaacaacta 
tggtggtgcc 
actaccgtgg 
ggtattacct 
cgttttcacc 
cgaataacac 
caaattacgc 
tgacttttgt 



gttggtgatt 
taacagcgat 
tactggcggc 
ttcagtgcca 
ctccggcaca 
tgcacagggc 
ggtatcgtta 
acgtaccgga 
ttatcaa 



360 
420 
480 
540 
600 
660 
720 
780 
837 



<210> 8 

<211> 837 

<212> DNA 

<213> B. coli 



<400> 8 

tttgcctgta 

tatgtaaact 

caaacctttt 

ggctcggctt 

agctatccat 

aagccgtggc 

aaagctggct 

gatttccagt 

tgcgatgttt 

attcctctta 

accgcagatg 

gtcggcgtac 

ggagcagtag 

gggcaggtga 



aaaccgccaa 

tggcgcccgc 

gccataacga 
atggcggcgt 
ttccgactac 
cggtggcgct 
cattaattgc 
ttgtgtggaa 
ctgctcatga 
ccgtttattg 
cgggcaactc 
agttgacgcg 
ggacttcggc 
ctgcagggaa 



tggcaccgct 
cgtgaatgtg 
ttatccggaa 
gttatctaat 
cagcgaaacg 
ttatttgacg 
cgtgcttatt 
tatttacgcc 
tgtcaccgtt 
tgcgaaaagc 
gattttcacc 
caacggtacg 
ggtgagtctg 
tgtgcaatcg 



atccctattg 
gggcaaaacc 
accattacag 
ttttccggga 
ccgcgggttg 
cctgtgagca 
ttgcgacaga 
aataatgatg 
actctgccgg 
caaaacctgg 
aataccgcgt 
attattccag 
ggattaacgg 
attattggcg 



gcggtggcag 
tggtcgtgga 
actatgtcac 
ccgtaaaata 
tttataattc 
gtgcgggtgg 
ccaacaacta 
tggtggtgcc 
actaccctgg 
ggtattacct 
cgttttcacc 
cgaataacac 
caaattacgc 
tgacttttgt 



cgccaatgtt 
tctttcgacg 
actgcaacga 
tagtggcagt 
gagaacggat 
ggtggcgatt 
taacagcgat 
tactggcggc 
ttcagtgcca 
ctccggcaca 
agcgcagggc 
ggtatcgtta 
acgtaccgga 
ttatcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
837 



<210> 9 

<211> 837 

<212> DNA 

<213> E. COli 



<400> 9 

ttcgcctgta 

tatgtaaacc 

caaatctttt 

ggttcggctt 

agctatcctt 

aagccgtggc 

aaagctggct 

gatttccagt 

tgtgatgctt 

attcctctta 

accgcagatg 

gtcggcgtac 

ggagcagtag 

gggcaggtga 



aaaccgccaa 
ttgcgcctgc 
gccataacga 
atggcggcgt 
tccctactac 
cggtggcgct 
cattaattgc 
ttgtgtggaa 
ctgctcgtga 
ccgtttattg 
cgggcaactc 
agttgacgcg 
ggacttcggc 
ctgcagggaa 



tggtaccgca 
cgtgaatgtg 
ttacccagaa 
gttatctagt 
cagcgaaacg 
ttatttgacg 
cgtgcttatt 
tatttacgcc 
tgtcaccgtt 
tgcgaaaagc 
gattttcacc 
caacggtacg 
ggtaagtctg 
tgtgcaatcg 



atccctattg 
gggcaaaacc 
accattacag 
ttttccggga 
c cgcgggt t g 
ccggtgagca 
ttgcgacaga 
aataatgatg 
actttgccgg 
caaaacctgg 
aataccgcgt 
attattccag 
ggattaacgg 
attattggcg 



gcggtggcag 
tggtcgtaga 
actatgtcac 
ccgtaaaata 
tttataattc 
gtgcgggggg 
ccaacaacta 
tggtggtgcc 
actaccctgg 
ggtattacct 
cgttttcacc 
cgaataacac 
caaattacgc 
tgacttttgt 



cgccaatgtt 
tctttcgacg 
actgcaacga 
taatggcagt 
gagaacggat 
agtggcgatt 
taacagcgat 
cactggcggc 
ttcagtgccg 
atccggcaca 
cgcgcagggc 
ggtatcgtta 
acgtaccgga 
ttatcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
837 
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<210> 10 
<211> 840 
<212> DNA 
<213> B. COli 

<400> 10 

ttcgcctgta aaaccgccaa tggcaccgct atccctattg gcggtggcag cgccaatgtt 60 

tatgtaaacc ttgcgcccgc cgtgaatgtg gggcaaaacc tggtcgtgga tctttcgacg 120 

caaatctttt gccataacga ttacccggaa accattacag attatgtcac actgcaacga 180 

ggctcggctt atggcggcgt gttatctaat ttttccggga ccgtaaaata tagtggcagt 240 

agctatccat ttccgaccac cagtgaaacg ccgcgggttg tttataattc gagaacggat 300 

aagccgtggc cggtggcgct ttatttgacg cctgtgagca gtgcgggcgg ggtggtgatt 360 

aaagctggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 420 

gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc cactggcggc 480 

tgcgatgttt ctgctcgtga tgtcaccgtt actctgccgg actaccctgg ttcagtgccg 540 

attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccggcaca 600 

accgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc tgcacagggc 660 

gtcggcgtac agttgacgcg caacggtacg attattccag cgaataacac ggtatcgtta 720 

ggagcagtag ggacttcggc ggtaagtctg ggattaacgg caaattacgc acgtaccgga 780 

gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaataa 840 



<210> 11 

<211> 837 

<212> DNA 

<213> E. coli 

<400> 11 

ttcgcctgta aaaccgccaa tggtaccgca atccctattg gcggtggcag cgccaatgtt 60 

tatgtaaacc ttgcgcctgc cgtgaatgtg gggcaaaacc tggtcgtaga tctttcgacg 120 

caaatctttt gccataacga ttacccagaa accattacag actatgtcac actgcaacga 180 

ggtgcggctt atggcggcgt gttatctagt ttttccggga ccgtaaaata taatggcagt 240 

agctatcctt tccctactac cagcgaaacg ccgcgggttg tttataattc gagaacggat 300 

aagccgtggc cggtggcgct ttatttgacg ccggtgagca gtgcgggggg agtggcgatt 360 

aaagctggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 420 

gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc cactggcggc 480 

tgcgatgttt ctgctcgtga tgtcaccgtt actctgccgg actaccctgg ttcagtgccg 540 

attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccggcaca 600 

accgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc cgcgcagggc 660 

gtcggcgtac agttgacgcg caacggtacg attattccag cgaataacac ggtatcgtta 720 

ggagcagtag ggacttcggc ggtaagtctg ggattaacgg caaattacgc acgtaccgga 780 

gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaa 837 

\ 

<210> 12 
<211> 840 
<212> DNA 
<213> E. coli 

<400> 12 

ttcgcctgta aaaccgccaa tggtaccgct atccctattg gcggtggcag cgccaatgtt 60 

tatgtaaacc ttgcgcccgt cgtgaatgtg gggcaaaacc tggtcgtgga tctttcgacg 120 

caaatctttt gccataacga ttatccggaa accattacag actatgtcac actgcaacga 180 
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ggctcggctt atggcggcgt gttatctaat ttttccggga ccgtaaaata tagtggcagt 240 

agctatccat ttcctaccac cagcgaaacg ccgcgcgttg tttataattc gagaacggat 300 

aagccgtggc cggtggcgct ttatttgacg cctgtgagca gtgcgggcgg ggtggcgatt 360 

aaagctggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 420 

gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc tactggcggc 480 

tgcgatgttt ctgctcgtga tgtcaccgtt actctgccgg actaccctgg ttcagtgcca 540 

attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccggcaca 600 

accgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc tgcacagggc 660 

gtcggcgtac agttgacgcg caacggtacg attattccag cgaataacac ggtatcgtta 720 

ggagcagtag ggacttcggc ggtgagtctg ggattaacgg caaattatgc acgtaccgga 780 

gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaataa 840 



<210> 13 
<211> 837 
<212> DNA 
<213> H. coli 

<400> 13 

ttcgcctgta aaaccgccaa tggtaccgca atccctattg gcggtggcag cgccaatgtt 60 

tatgtaaacc ttgcgcctgc cgtgaatgtg gggcaaaacc tggtcgtaga tctttcgacg 120 

caaatctttt gccataacga ttacccagaa accattacag actatgtcac actgcaacga 180 

ggttcggctt atggcagcgt gttatctagt ttttccggga ccgtaaaata taatggcagt 240 

agctatcctt tccctactac cagcgaaacg ccgcgggttg tttataattc gagaacggat 300 

aagccgtggc cggtggcgct ttatttgacg ccggtgagca gtgcgggggg agtggcgatt 360 

aaagctggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 420 

gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc cactggcggc 480 

tgtgatgttt ctgctcgtga tgtcaccgtt actctgccgg actaccctgg ttcagtgccg 540 

attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct atccggcaca 600 

accgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc cgcgcagggc 660 

gtcggcgtac agttgacgcg caacggtacg attattccag cgaataacac ggtatcgtta 720 

ggagcagtag ggacttcggc ggtaagtctg ggattaacgg caaattacgc acgtaccgga 780 

gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaa 837 



<210> 14 
<211> 837 
<212> DNA 
<213> E. coli 

<400> 14 

ttcgcctgta aaaccgccaa tggtaccgca atccctattg gcggtggcag cgccaatgtt 60 

tatgtaaacc ttgcgcctgc cgtgaatgtg gggcaaaacc tggtcgtaga tctttcgacg 120 

caaatctttt gccataacga ttacccagaa accattacag actatgtcac actgcaacga 180 

ggttcggctt atggcagcgt gttatctagt ttttccggga ccgtaaaata taatggcagt 240 

agctatcctt tccctactac cagcgaaacg ccgcgggttg tttataattc gagaacggat 300 

aagccgtggc cggtggcgct ttatttgacg ccggtgagca gtgcgggggg agtggcgatt 360 

aaagctggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 420 

gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc cactggcggc 480 

tgtgatgttt ctgctcgtga tgtcaccgtt actctgccgg actaccctgg ttcagtgccg 540 

attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct atccggcaca 600 

accgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc cgcgcagggc 660 

gtcggcgtac agttgacgcg caacggtacg attattccag cgaataacac ggtatcgtta 720 

ggagcagtag ggacttcggc ggtaagtctg ggattaacgg caaattacgc acgtaccgga 780 
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gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaa 



837 



<210> 15 

<211> 837 

<212> DNA 

<213> E. COli 



<400> 15 

ttcgcctata 

tatgtaaacc 

caaatctttt 

ggctcggctt 

agctatccat 

aagccgtggc 

aaagctggct 

gatttccagt 

tgcgatgttt 

attcctctta 

accgcagatg 

gtcggcgtac 

ggagcagtag 

gggcaggtga 



aaaccgccaa 
ttgcgcccgc 
gccataacga 
atggcggcgt 
ttcctaccac 
cggtggcgct 
cattaattgc 
ttgtgtggaa 
ctgctcgtga 
ccgtttattg 
cgggcaactc 
agttgacgcg 
ggacttcggc 
ctgcagggaa 



tggtaccgct 
cgtgaatgtg 
ttatccggaa 
gttatctaat 
cagcgaaacg 
ttatttgacg 
cgtgcttatt 
tatttacgcc 
tgtcaccgtt 
tgcgaaaagc 
gattttcacc 
caacggtacg 
ggtgagtctg 
tgtgcaatcg 



atccctattg 
gggcaaaacc 
accattacag 
ttttccggga 
ccgcgcgttg 
cctgtgagca 
ttgcgacaga 
aataatgatg 
actctgccgg 
caaaacctgg 
aataccgcgt 
attattccag 
ggattaacgg 
attattggcg 



gcggtggcag 
tggtcgtgga 
actatgtcac 
ccgtagaata 
tttataattc 
gtgcgggcgg 
ccaacaacta 
tggtggtgcc 
actaccctgg 
ggtattacct 
cgttttcacc 
cgaataacac 
caaattatgc 
tgacttttgt 



cgccaatgtt 
tctttcgacg 
actgcaacga 
tagtggcagt 
gagaacggat 
ggtggcgatt 
taacagcgat 
tactggcggc 
ttcagtgcca 
ctccggcaca 
tgcacagggc 
ggtatcttta 
acgtaccgga 
ttatcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
837 



<210> 16 

<211> 837 

<212> DNA 

<213> £ . coli 



<400> 16 

atcgcctgta 

tatgtaaacc 

caaatctttt 

ggttcggctt 

agctatccat 

aagccgtggc 

aaggctggct 

gatttccagt 

tgtgatgttt 

attcctctta 

accgcagatg 

gtcggcgtac 

ggagcagtag 

gggcaggtga 



aaaccgccaa 
ttgcgcccgc 
gccataacga 
atggcggcgt 
ttcctaccac 
cggtggcgct 
cattaatggc 
ttgtgtggaa 
ctgctcgtga 
ccgtttattg 
cgggcaactc 
agttaacgcg 
ggacttcggc 
ctgcagggaa 



tggcaccgct 
cgtgaatgtg 
ttacccggaa 
gttatctcat 
cagcgaaacg 
ttatttgacg 
tgtgctaatt 
tatttacgcc 
tgtcaccgtt 
tgcgaaaagc 
gattttcacc 
caacggtacg 
ggtaagtctg 
tgtgcaatcg 



atccctattg 
gggcaaaacc 
accattacag 
ttttccggga 
ccgcgcgttg 
cctgtgagca 
ttgcgacaga 
aataatgatg 
actctgccag 
caaaacctgg 
aataccgcgt 
attaatccag 
ggattaacgg 
attattggcg 



gcggtggcag 
tggtcgtaga 
actatgtcac 
ccgtaaaata 
tttataattc 
gtgcgggtgg 
ccaataacta 

tggtggtgcc 

actaccctgg 
ggtattacct 
cgttttcacc 
cgaataacac 
caaattacgc 
tgacttttgt 



cgccaatgtt 
tctttcgacg 
actgcaacga 
tagtggcagt 
gagaacggat 
ggtggcgatt 
taacagcgat 
cactggcggc 
ttcagtgccg 
ctccggcaca 
tgcacagggc 
ggtatcgtta 
acgtaccgga 
ttatcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
837 



<210> 17 

<211> 837 

<212> DNA 

<213> E. coli 



<400> 17 

ttcgcctgta aaaccgccaa tggtaccgct atccctattg gcggtggcag cgccaatgtt 



60 
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tatgtaaacc ttgcgcccgt cgtgaatgtg gggcaaaacc tggtcgtgga tctttcgacg 120 

caaatctttt gccataacga ttatccggaa accattacag actatgtcac actgcaacga 180 

ggctcggctt atggcggcgt gttatctaat ttttccggga ccgtaaaata tagtggcagt 240 

agctatccat ttcctaccac cagcgaaacg ccgcgcgttg tttataattc gagaacggat 300 

aagccgtggc cggtggcgct ttatttgacg cctgtgagca gtgcgggcgg ggtggcgatt 360 

aaagctggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 420 

gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc tactggcggc 480 

tgcgatgttt ctgctcgtga tgtcaccgtt actctgccgg actaccgtgg ttcagtgcca 540 

attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccggcaca 600 

cacgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc tgcacagggc 660 

gtcggcgtac agttgacgcg caacggtacg attattccag cgaataacac ggtatcgtta 720 

ggagcagtag ggacttcggc ggtgagtctg ggattaacgg caaattatgc acgtaccgga 780 

gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaa 837 



<210> 18 

<211> 837 

<212> DNA 

<213> B. coli 



<400> 18 

ttcgcctgta aaaccgccaa tggtaccgct atccctattg gcggtggcag cgccaatgtt 60 

tatgtaaacc ttgcgcctgc cgtgaatgtg gggcaaaacc tggtcgtgga tctttcgacg 120 

caaatctttt gccataacga ttacccggaa accattacag actatgtcac actgcaacga 180 

ggttcggctt atggcggcgt gttatctagt ttttccggga ccgtaaaata taatggcagt 240 

agctatcctt tccctactac cagcgaaacg ccgcgggttg tttataattc gagaacggat 3 00 

aagccgtggc cggtggcgct ttatttgacg cctgtgagca gtgcgggggg agtggcgatt 360 

aaagctggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 420 

gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc cactggcggc 480 

tgcgatgttt ctgctcgtga tgtcaccgtt actctgccgg actaccctgg ttcagtgccg 540 

attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccggcaca 600 

accgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc cgcgcagggc 660 

gtcggcgtac agttggcgcg caacggtacg gttattccag cgaataacac ggtatcgtta 720 

ggagcagtag ggacttcggc ggtaagtctg ggattaacgg caaattacgc acgtaccgga 780 

gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaa 837 



<210> 19 

<211> 837 

<212> DNA 

<213> E. coli 



<400> 19 

ttcgcctgta 

tatgtaaacc 

caaatctttt 

ggttcggctt 

agctatcctt 

aagccgtggc 

aaagctggct 

gatttccagt 

tgtgatgttt 

attcctctta 

accgcagatg 



aaaccgccaa 
ttgcgcctgc 
gccataacga 
atggcggcgt 
tccctactac 
cggtggcgct 
cattaattgc 
ttgtgtggaa 
ctgctcgtga 
ccgtttattg 
cgggcaactc 



tggtaccgca 
cgtgaatgtg 
ttacccagaa 
gttatctagt 
cagcgaaacg 
ttatttgacg 
cgtgcttatt 
tatttacgcc 
tgtcaccgtt 
tgcgaaaagc 
gattttcacc 



atccctattg 
gggcaaaacc 
accattacag 
ttttccggga 
ccgcgggttg 
ctggtgagca 
ttgcgacaga 
aataatgatg 
actctgccgg 
caaaacctgg 
aataccgcgt 



gcggtggcag 
tggtcgtaga 
actatgtcac 
ccgtaaaata 
tttataattc 
gtgcgggggg 
ccaacaacta 
tggtggtgcc 
actaccctgg 
ggtattacct 
cgttttcacc 



cgccaatgtt 
tctttcgacg 
actgcaacga 
taatggcagt 
gagaacggat 
agtggcgatt 
taacagcgat 
cactggcggc 
ttcagtgccg 
atccggcaca 
cgcgcagggc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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gtcggcgtac agttgacgcg caacggtacg attattccag cgaataacac ggtatcgtta 720 
ggagcagtag ggacttcggc ggtaagtctg ggattaacgg caaattacgc acgtaccgga 780 
gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaa 837 



<210> 20 

<211> 837 

<212> DNA 

<213> E. coli 



<400> 20 

ttcgcctgta aaaccgccaa tggtaccgct atccctattg gcggtggcag cgctaatgtt 60 

tatgtaaacc ttgcgcctgc cgtgaatgtg gggcaaaacc tggtcgtaga tctttcgacg 120 

caaatctttt gccataacga ttatccggaa accattacag actatgtcac actgcaacga 180 

ggctcggctt atggcggcgt gttatctaat ttttccggga ccgtaaaata tagtggcagt 240 

agctatccat ttccgaccac cagcgaaacg ccgcgggttg tttataattc gagaacggat 300 

aagccgtggc cggtggcgct ttatttgacg cctgtgagca gtgcgggcgg ggtggcgatt 3 SO 

aaagctggct cattaattgc cgtgcttatt ttgcgacaga ccaaaaacta taacagcgat 420 

gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtagtgcc tactggcggc 480 

tgcgatgttt ctgctcgtga tgtcaccgtt actctgccgg actaccctgg ttcagtgcca 54 0 

attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccggcaca 600 

accgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc agcgcagggc 660 

gtcggcgtac agttgacgcg caacggtacg attattccag cgaataacac ggtatcgtta 720 

ggaacagtag gaacttcggc ggtaagtctg ggattaacgg caaattacgc acgtaccggc 780 

gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaa 837 



<210> 21 

<211> 837 

<212> DNA 

<213> E. coli. 



<400> 21 

ttcgcctgta 

tatgtaaacc 

caaatctttt 

ggctcggctt 

agctatccat 

aagccgtggc 

aaagctggct 

gatttccagt 

tgcgatgttt 

attcctctta 

accgcagatg 

gtcggcgtac 

ggagcagtag 

gggcaggtga 



aaaccgccaa 
ttgcgcccgt 
gccataacga 
atggcggcgt 
ttcctaccac 
cggtggcgct 
cattaattgc 
ttgtgtggaa 
ctgctcgtga 
ccgtttattg 
cgggcaactc 
agttgacgcg 
ggacttcggc 
ctgcagggaa 



tggtaccgct 
cgtgaatgtg 
ttatccggaa 
gttatctaat 
cagcgaaacg 
ttatttgacg 
cgtgcttatt 
tatttacgcc 
tgtcaccgtt 
tgcgaaaagc 
gattttcacc 
caacggtacg 

ggtgagtctg 

tgtgcaatcg 



atccctattg 
gggcaaaacc 
accattacag 
ttttccggga 
ccgcgcgttg 
cctgtgagca 
ttgcgacaga 
aataatgatg 
actctgccgg 
caaaacctgg 
aataccgcgt 
attattccag 
ggattaacgg 
attattggcg 



gcggtggcag 
tggtcgtgga 
actatgtcac 
ccgtaaaata 
tttataattc 
gtgcgggcgg 
ccaacaacta 
tggtggtgcc 
actaccctgg 
ggtattacct 
cgttttcacc 
cgaataacac 
caaattatgc 
tgacttttgt 



cgccaatgtt 
tctttcgacg 
actgcaacga 
tagtggcagt 
gagaacggat 
ggtggcgatt 
taacagcgat 
tactggcggc 
ttcagtgcca 
ctccggcaca 
tgcacagggc 
ggtatcgtta 
acgtaccgga 
ttatcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
837 



<210> 22 

<211> 837 

<212> DNA 

<213> E. coli 
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<400> 22 

ttcgcctgta aaaccgccaa tggtaccgca atccctattg gcggtggcag cgccaatgtt 60- 

tatgtaaacc ttgcgcctgc cgtgaatgtg gggcaaaacc tggtcgtaga tctttcgacg 120 

caaatctttt gccataacga ttacccagaa accattacag actatgtcac actgcaacga 180 

ggtgcggctt atggcggcgt gttatctagt ttttccggga ccgtaaaata taatggcagt 240 

agctatcctt tccctactac cagcgaaacg ccgcgggttg tttataattc gagaacggat 3 00 

aagccgtggc cggtggcgct ttatttgacg ccggtgagca gtgcgggggg agtggcgatt 360 

aaagctggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 420 

gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc cactggcggc 480 

tgcgatgttt ctgctcgtga tgtcaccgtt actctgccgg actaccctgg ttcagtgccg 540 

attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccggcaca 600 

accgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc cgcgcagggc 660 

gtcggcgtac agttgacgcg caacggtacg attattccag cgaataacac ggtatcgtta 720 

ggagcagtag ggacttcggc ggtaagtctg ggattaacgg caaattacgc acgtaccgga 780 

gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaa 837 



<210> 23 

<211> 279 

<212> PRT 

<213> E. coli 

<400> 23 

Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala lie Pro lie Gly Gly Gly 
15 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Val Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin lie Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Asn Phe Ser Gly He Val Lys Tyr Ser Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly Ser Leu He Ala Val 
115 120 125 

Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Ala Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
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165 



170 



175 



Gly Ser Val Pro lie Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser lie 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Ala Arg Asn Gly Thr Val lie Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser He He 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 



<210> 24 

<211> 279 

<212> PRT 

<213> B. COli 

<400> 24 

Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He Pro He Gly Gly Gly 
1 5 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin He Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Ser Phe Ser Gly He Val Lys Tyr Asn Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly Ser Leu He Ala Val 
115 120 125 

Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 



275 
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130 



140 



Val Trp Asn lie Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Ala Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
165 170 * 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser He 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser He He 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 



<210> 25 

<211> 279 

<212> PRT 

<213> E. coli 

<400> 25 

Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He Pro He Gly Gly Gly 
15 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin He Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin Arg Gly Ala Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Ser Phe Ser Gly Thr Val Lys Tyr Asn Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 



275 
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100 



105 



110 



Ser Ser Ala Gly Gly Val Ala lie Lys Ala Gly Ser Leu lie Ala Val 
115 120 125 

Leu lie Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn lie Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
165 170 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser lie 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser He He 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 



<210> 26 

<211> 279 

<212> PRT 

<213> E. coli 

<400> 26 

Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He Pro He Gly Gly Gly 
15 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin He Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin Arg Gly Ala Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Ser Phe Ser Gly Thr Val Lys Tyr Asn Gly Ser 



275 





WO 02/04496 



PCT/US01/21525 



65 



70 



75 



80 



Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Ala lie Lys Ala Gly Ser Leu lie Ala Val 
115 - 120 125 

Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn lie Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
165 170 175 

Gly Ser Val Pro lie Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser lie 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr lie He Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser He He 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 



<210> 


27 


<211> 


279 


<212> 


PRT 


<213> 


E. coli 


<400> 


27 



Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He Pro He Gly Gly Gly 
15 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin He Phe Cys His Asn Asp Tyr 
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35 



40 



45 



Pro Glu Thr lie Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val Lys Tyr Ser Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Ala lie Lys Ala Gly Ser Leu He Ala Val 
115 120 125 

Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Ala His Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
165 170 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser He 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser He He 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 



<210> 


28 


<211> 


279 


<212> 


PRT 


<213> 


E . coli 


<400> 


28 



Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He Pro He Gly Gly Gly 
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1 



5 



10 



15 



Ser Ala Asn Val Tyr Val Asn Leu Ala lie Ala Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin lie Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr lie Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val Lys Tyr Ser Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Val lie Lys Ala Gly Ser Leu lie Ala Val 
115 120 125 

Leu lie Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
165 170 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser He 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser He lie 
260 265 270 

Gly Ala Thr Phe Val Tyr Gin 



275 



<210> 29 
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<211> 279 
<212> PRT 
<213> E. coli 

<400> 29 

Phe Ala Cys Lys Tlir Ala Asn Gly Thr Ala lie Pro lie Gly Gly Gly 
15 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Val Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin He Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr lie Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val Lys Tyr Ser Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Leu Val He Lys Ala Gly Ser Leu He Ala Val 
115 120 125 

Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
165 170 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser He 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

» 

Leu Thr Arg Asn Gly Thr He He Pro Thr Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser He He 
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260 265 270 

Gly Val Thr Phe Val Tyr Gin 
275 

<210> 30 

<211> 280 

<212> PRT 

<213> E. coli 

<400> 30 

Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala lie Pro He Gly Gly Gly 
15 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin Thr Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val Lys Tyr Ser Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Ala He' Lys Ala Gly Ser Leu He Ala Val 
115 120 125 

Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Ala His Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
165 170 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser He 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asn Thr Val Ser Leu 
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225 



230 



235 



240 



Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser lie lie 
260 265 270 

Gly Val Thr Phe Val Tyr Gin Glx 



<210> 31 

<211> 279 

<212> PRT 

<213> B . coli 

<400> 31 

Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He Pro He Gly Gly Gly 
15 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin He Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Ser Phe Ser Gly Thr Val Lys Tyr Asn Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly Ser Leu He Ala Val 
115 120 125 

Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Ala Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
165 170 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 



275 



280 



180 



185 



190 
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Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser lie 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr lie lie Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser lie He 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 



<210> 32 

<211> 279 

<212> PRT 

<213> E. COli 

<400> 32 

Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He Pro He Gly Gly Gly 
1 5 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin He Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val Lys Tyr Ser Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Val He Lys Ala Gly Ser Leu He Ala Val 
115 120 125 

Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 



275 
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Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Pro 
165 170 175 

Gly Ser Val Pro lie Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr Thr Ala Asp Ala Gly Asn Ser lie 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser He He 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 
275 



<210> 


33 




<211> 


279 




<212> 


PRT 




<213> 


E . coli 


<400> 


33 




Phe Ala Cys 


Lys 


1 






Ser Ala Asn 


Val 



10 15 



20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin He Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin Arg Gly Ala Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Ser Phe Ser Gly Thr Val Lys Tyr Asn Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly Ser Leu He Ala Val 
115 120 125 



21 



WO 02/04496 PCT/U SOI 12 1525 



Leu lie Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn lie Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Pro 
165 170 175 

Gly Ser Val Pro lie Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr Thr Ala Asp Ala Gly Asn Ser lie 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr lie lie Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser lie lie 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 
275 

<210> 34 

<211> 279 

<212> PRT 

<213> B . coli 

<400> 34 

Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala lie Pro lie Gly Gly Gly 
15 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Val Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin lie Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr lie Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val Lys Tyr Ser Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 
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Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly Ser Leu He Ala Val 
115 120 125 

Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Pro 
165 170 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr Thr Ala Asp Ala Gly Asn Ser He 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser He He 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 



<210> 


35 


<211> 


279 


<212> 


PRT 


<213> 


E. coli 


<400> 


35 



Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He Pro He Gly Gly Gly 
15 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin He Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 



275 
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Gly Ser Val Leu Ser Ser Phe Ser Gly Thr Val Lys Tyr Asn Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly Ser Leu He Ala Val 
115 120 125 

Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
165 170 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser He 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser He He 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 
275 

<210> 36 

<211> 279 

<212> PRT 

<213> B. coli 

<400> 36 

Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He Pro He Gly Gly Gly 
15 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Val Val Asn Val Gly Gin 
20 25 30 
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Asn Leu Val Val Asp Leu Ser Thr Gin lie Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr lie Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val Lys Tyr Ser Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 . 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Val lie Lys Ala Gly Ser Leu He Ala Val 
115 120 125 

Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
165 170 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser He 
195 200 . 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Arg Ser He He 
260 265 270 





275 


<210> 


37 


<211> 


279 


<212> 


PRT 


<213> 


E. coli 


<400> 


37 
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Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala lie Pro He Gly Gly Gly 
15 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin He Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val Glu Tyr Ser Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly Ser Leu He Ala Val 
115 120 125 

Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
165 170 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser He 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser He He 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 
275 
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<210> 38 

<211> 279 

<212> PRT 

<213> E. coli 

<400> 38 

Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala lie Pro lie Gly Gly Gly 
15 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin He Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser His Phe Ser Gly Thr Val Lys Tyr Ser Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly Ser Leu Met Ala Val 
115 120 125 

Leu lie Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
165 170 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser He 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr He Asn Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 



245 



250 



255 
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Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser lie lie 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 
275 

<210> 39 

<211> 279 

<212> PRT 

<213> B. coli 

<400> 39 

Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala lie Pro lie Gly Gly Gly 
1 5 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala Val Asn Val Gly Gin 
s 20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin lie Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr lie Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val Lys Tyr Ser Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 . 105 110 

Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly Ser Leu He Ala Val 
115 120 125 

Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn lie Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Val Arg Asp Val Thr Val He Leu Pro Asp Tyr Arg 
165 170 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser He 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 
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Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asix Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Lys Ser He He 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 
275 



<210> 


40 




<211> 


279 




<212> 


PRT 




<213> 


E. coli 


<400> 


40 




Phe Ala Cys 


Lys 


1 






Ser Ala Asn 


Val 



10 15 



20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin He Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 ~ 55 60 

Gly Gly Val Leu Ser Ser Phe Ser Gly Thr Val Lys Tyr Asn Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly Ser Leu He Ala Val 
115 120 125 

Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
165 170 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 
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Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser lie 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Ala Asn Gly Thr lie Val Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser lie He 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 



<210> 41 

<211> 279 

<212> PRT 

<213> E. coli 

<400> 41 

Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He Pro He Gly Gly Gly 
15 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin He Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Ser Phe Ser Gly Thr Val Lys Tyr Asn Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Leu Val 
100 105 110 

Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly Ser Leu He Ala Val 
115 120 125 

Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 
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Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
165 170 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser He 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser He He 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 



<210> 42 

<211> 279 

<212> PRT 

<213> E ► coli 

<400> 42 

Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He Pro He Gly Gly Gly 
1 5 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Val Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin He Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val Lys Tyr Asn Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly Ser Leu He Ala Val 
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Leu lie Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
165 170 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser He 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser He He 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 



<210> 43 

<211> 279 

<212> PRT 

<213> E. coli 

<400> 43 

Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He Pro He Gly Gly Gly 
15 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin He Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val Lys Tyr Ser Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
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Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly Ser Leu He Ala Val 
115 120 125 

Leu He Leu Arg Gin Thr Lys Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
165 170 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser He 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Thr Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser He He 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 



<210> 44 

<211> 279 

<212> PRT 

<213> E. coli 

<400> 44 

Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He Pro He Gly Gly Gly 
15 10 15 

Ser Ala. Asn Val Tyr Val Asn Leu Ala Pro Val Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin He Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 
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Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val Lys Tyr Ser Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Ala lie Lys Ala Gly Ser Leu He Ala Val 
115 120 125 

Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Pro 
165 170 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr Thr Ala Asp Ala Gly Asn Ser He 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Giy Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser He He 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 
275 

<210> 45 

<211> 279 

<212> PRT 

<213> E. COli 

<400> 45 

Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He Pro He Gly Gly Gly 
15 10 15 

Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala Val Asn Val Gly Gin 
20 25 30 
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Asn Leu Val Val Asp Leu Ser Thr Gin lie Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin Arg Gly Ala Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Ser Phe Ser Gly Thr Val Lys Tyr Asn Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly Ser Leu He Ala Val 
115 120 125 

Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
165 170 175 

Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser He 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser He He 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 
275 

<210> 46 

<211> 7416 

<212> DNA 

<213> Artificial 

<220> 
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<223> Sequence of Plaemid pCGA126-l 
<400> 46 

gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 60 

cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct 120 

cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat 180 

tgtgagcgga taacaatttc acacaggaaa cagctatgag taataaaaac gtcaatgtaa 240 

ggaaatcgca ggaaataaca ttctgcttgc tggcaggtat cctgatgttc atggcaatga 300 

tggttgccgg acgcgctgaa gcgggagtgg ccttaggtgc gactcgcgta atttatccgg 360 

cagggcaaaa acaagtgcaa cttgccgtga caaataatga tgaaaatagt acctatttaa 420 

ttcaatcatg ggtggaaaat gccgatggtg taaaggatgg tcgttttatc gtgacgcctc 480 

ctctgtttgc gatgaaggga aaaaaagaga ataccttacg tattcttgat gcaacaaata 540 

accaattgcc acaggaccgg gaaagtttat tctggatgaa cgttaaagcg attccgtcaa 600 

tggataaatc aaaattgact gagaatacgc tacagctcgc aattatcagc cgcattaaac 660 

tgtactatcg cccggctaaa ttagcgttgc cacccgatca ggccgcagaa aaattaagat 720 

ttcgtcgtag cgcgaattct ctgacgctga ttaacccgac accctattac ctgacggtaa 780 

cagagttgaa tgccggaacc cgggttcttg aaaatgcatt ggtgcctcca atgggcgaaa 840 

gcacggttaa attgccttct gatgcaggaa gcaatattac ttaccgaaca ataaatgatt 900 

atggcgcact tacccccaaa atgacgggcg taatggaata atagatctca cgacaggttt 960 

cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct cactcattag 1020 

gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat tgtgagcgga 1080 

taacaatttc acacaggaaa cagctatgaa acgagttatt accctgtttg ctgtactgct 1140 

gatgggctgg tcggtaaatg cctggtcatt cgcctgtaaa accgccaatg gtaccgctat 1200 

ccctattggc ggtggcagcg ccaatgttta tgtaaacctt gcgcccgtcg tgaatgtggg 1260 

gcaaaacctg gtcgtggatc tttcgacgca aatcttttgc cataacgatt atccggaaac 1320 

cattacagac tatgtcacac tgcaacgagg ctcggcttat ggcggcgtgt tatctaattt 1380 

ttccgggacc gtaaaatata gtggcagtag ctatccattt cctaccacca gcgaaacgcc 1440 

gcgcgttgtt tataattcga gaacggataa gccgtggccg gtggcgcttt atttgacgcc 1500 

tgtgagcagt gcgggcgggg tggcgattaa agctggctca ttaattgccg tgcttatttt 1560 

gcgacagacc aacaactata acagcgatga tttccagttt gtgtggaata tttacgccaa 1620 

taatgatgtg gtggtgccta ctggcggctg cgatgtttct gctcgtgatg tcaccgttac 1680 

tctgccggac taccctggtt cagtgccaat tcctcttacc gtttattgtg cgaaaagcca 1740 

aaacctgggg tattacctct ccggcacaac cgcagatgcg ggcaactcga ttttcaccaa 1800 

taccgcgtcg ttttcacctg cacagggcgt cggcgtacag ttgacgcgca acggtacgat 1860 

tattccagcg aataacacgg tatcgttagg agcagtaggg acttcggcgg tgagtctggg 1920 

attaacggca aattatgcac gtaccggagg gcaggtgact gcagggaatg tgcaatcgat 1980 

tattggcgtg acttttgttt atcaataagg atccgtcgac ctgcaggcat gcaagcttgg 2040 

cactggccgt cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc caacttaatc 2100 

gccttgcagc acatccccct ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc 2160 

gcccttccca acagttgcgc agcctgaatg gcgaatggcg cctgatgcgg tattttctcc 2220 

ttacgcatct gtgcggtatt tcacaccgca taaattccct gttttggcgg atgagagaag 2280 

attttcagcc tgatacagat taaatcagaa cgcagaagcg gtctgataaa acagaatttg 2340 

cctggcggca gtagcgcggt ggtcccacct gaccccatgc cgaactcaag aagtgaaacg 2400 

ccgtagcgcc gatggtagtg ttggggtctc cccatgcgag agtagggaac tgccaggcat 2460 

caaataaaac gaaaggctca gtcgaaagac tgggcctttc gttttatctg ttgtttgtcg 2520 

gtgaacgctc tcctgagtag gacaaatccg ccgggagcgg atttgaacgt tgcgaagcaa 2580 

cggcccgggg tgggcgggca ggacgcccgc catatactgc caggcatcaa attaagcata 2640 

aggccatctg acgtatggcc tttttgcgtt tttacaacct cttccgtcca cctgacccca 2700 

tgccgaactc aaagtgaaac gccgtagcgc cgatggtagt gtggggtctc cccatgcgag 2760 

agtagggaac tgccaggcat caaataaaac gaaaggctca gtcgaaagac tgggcctttc 2820 

gttttatctg ttgtttgtcg gtgaacgctc tcctgagtag gacaaatccg cccgggagcg 2880 

gatttgaacg ttgcgaagca acggcccgga gggtggcggg caggacgccc gccataaact 2940 

gccaggcatc aaattaagca gaaggccatc ctgacggatg gcctttttgc gtttctacaa 3000 

actcttcctg tcgtcatatc tacaagccat ccccccacag atacggtaaa ctagcctcgt 3060 
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ttttgcatca ggaaagcagg gaatttatgg tgcactctca gtacaatctg ctctgatgcc 3120 

gcatagttaa gccagccccg acacccgcca acacccgctg acgcgccctg acgggcttgt 3180 

ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg catgtgtcag 3240 

aggttttcac cgtcatcacc gaaacgcgcg agacgaaagg gccatgaaca ataaaactgt 33 00 

ctgcttacat aaacagtaat acaaggggtg ttatgagcca tattcaacgg gaaacgtctt 3360 

ctctaggccg cgattaaatt ccaacatgga tgctgattta tatgggtata aatgggctcg 3420 

cgataatgtc gggcaatcag gtgcgacaat ctatcgattg tatgggaagc ccgatgcgcc 3480 

agagttgttt ctgaaacatg gcacaggtag cgctgccaat gatgttacag atgagatggt 3540 

cagactaaac tggctgacgg aatttatgcc tcttcgacca tcaaccattt tatccgtact 3600 

cctgatgatg catggttact caccactgcg atccccggaa aacagcattc caggtattag 3660 

aagaatatcc tgattcaggfc gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt 3720 

tgcattcgat tcctgtttgt aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc 3780 

aggcgcaatc accaatgaat aacggtttgg ttgatgcgag tgattttgat gacgagcgta 3840 

atggctggcc tgttgaacaa gtctggaaag aaatgcataa acttttgcca ttctcaccgg 3900 

attcagtcgt cactcatggt gatttctcac ttaataacct tatttttgac gaggggaaat 3960 

taataggttg tattgatgtt ggacgagtcg gaatcgcaga ccgataccag gatcttgcca 4020 

tcctatggaa ctgcctcggt gagttttctc cttcattaca gaaacggctt tttcaaaaat 4080 

atggtattga taatcctgat atgaataaat tgcagtttca tttgatgctc gatgagtttt 4140 

tctaagaatt aattcatggc cctcgtgata cgcctatttt tataggttaa tgtcatgata 4200 

ataatggttt cttagacgtg aggttctgta cccgacacca tcgaatggcg caaaaccttt 4260 

cgcggtatgg catgatagcg cccggaagag agtcaattca gggtggtgaa tgtgaaacca 4320 

gtaacgttat acgatgtcgc agagtatgcc ggtgtctctt atcagaccgt ttcccgcgtg 4380 

gtgaaccagg ccagccacgt ttctgcgaaa acgcgggaaa aagtggaagc ggcgatggcg 4440 

gagctgaatt acattcccaa ccgcgtggca caacaactgg cgggcaaaca gtcgttgctg 4500 

attggcgttg ccacctccag tctggccctg cacgcgccgt cgcaaattgt cgcggcgatt 4560 

aaatctcgcg ccgatcaact gggtgccagc gtggtggtgt cgatggtaga acgaagcggc 4620 

gtcgaagcct gtaaagcggc ggtgcacaat cttctcgcgc aacgcgtcag tgggctgatc 4680 

attaactatc cgctggatga ccaggatgcc attgctgtgg aagctgcctg cactaatgtt 4740 

ccggcgttat ttcttgatgt ctctgaccag acacccatca acagtattat tttctcccat 4800 

gaagacggta cgcgactggg cgtggagcat ctggtcgcat tgggt caeca geaaatcgeg 4860 

ctgttagcgg geccattaag ttctgtctcg gcgcgtctgc gtctggctgg ctggcataaa 4920 

tatctcactc gcaatcaaat teagecgata gcggaacggg aaggcgactg gagtgccatg 4980 

teeggtttte aacaaaccat geaaatgetg aatgagggca tcgttcccac tgcgatgctg 5040 

gttgccaacg atcagatggc getgggegea atgcgcgcca ttaccgagtc cgggctgcgc 5100 

gttggtgcgg atatcteggt agtgggatac gacgataccg aagacagctc atgttatatc 5160 

ccgccgttaa ccaccatcaa acaggatttt cgcctgctgg ggcaaaccag cgtggaccgc 5220 

ttgetgeaac tctctcaggg ecaggeggtg aagggcaatc agctgttgcc cgtctcactg 5280 

gtgaaaagaa aaaccaccct ggcgcccaat acgcaaaccg cctctccccg cgcgttggcc 5340 

gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagegggea gtgagcgcaa 5400 

cgcaattaat ggtaagttag ctcactcatt aggcacccca aggctttaca etttatgett 5460 

ccgacctgga agaacctgac gtcaggtggc actttteggg gaaatgtgcg cggaacccct 5520 

atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca ataaccctga 5580 

taaatgette aataatattg aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc 5640 

cttattccct tttttgegge attttgeett cctgtttttg ctcacccaga aacgctggtg 5700 

aaagtaaaag atgetgaaga tcagttgggt gcacgagtgg gttacatcga actggatctc 5760 

aacageggta agatccttga gagttttege cccgaagaac gttttccaat gatgagcact 5820 

tttaaagttc tgctatgtgg cgeggtatta tecegtattg acgccgggca agagcaactc 5880 

gggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa 5940 

gcatcttacg gatggcatga cagtaagaga attatgeagt getgecataa ccatgagtga 6000 

taacactgcg gccaacttac ttctgacaac gateggagga ccgaaggagc taaccgcttt 6060 

tttgeacaac atgggggatc atgtaactcg ecttgategt tgggaacegg agctgaatga 6120 

agccatacca aacgacgagc gtgacaccac gatgcctgta geaatgeaac aacgttggcg 6180 

caaactatta actggegaac tacttactct agcttcccgg caacaattaa tagactggat 6240 

ggaggeggat aaagttgcag gaccacttct gcgctcggcc cttccggctg gctggtttat 6300 
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tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattgcag cactggggcc 6360 

agatggtaag ccctcccgta tcgtagttat ctacacgacg gggagtcagg caactatgga 6420 

tgaacgaaat agacagatcg ctgagatagg tgcctcactg attaagcatt ggtaactgtc 6480 

agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt aatttaaaag 6540 

gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc 6600 

gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt 6660 

tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt 6720 

gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat 6780 

accaaatact gttcttctag tgtagccgta gctaggccac cacttcaaga actctgtagc 6840 

accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa 6900 

gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg 6960 

ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag 7020 

atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag 7080 

gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa 7140 

cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt 7200 

gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg 7260 

gttcctggcc ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc 7320 

tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac 7380 

cgagcgcagc gagtcagtga gcgaggaagc ggaaga 7416 



<210> 47 
<211> 726 
<212> DNA 
<213> Artificial 

<220> 

<223> Sequence of J96 fimC plus native signal sequence 
<400> 47 

atgagtaata aaaacgtcaa tgtaaggaaa tcgcaggaaa taacattctg cttgctggca 60 

ggtatcctga tgttcatggc aatgatggtt gccggacgcg ctgaagcggg agtggcctta 120 

ggtgcgactc gcgtaattta tccggcaggg caaaaacaag tgcaacttgc cgtgacaaat 180 

aatgatgaaa atagtaccta tttaattcaa tcatgggtgg aaaatgccga tggtgtaaag 240 

gatggtcgtt ttatcgtgac gcctcctctg tttgcgatga agggaaaaaa agagaatacc 3 00 

ttacgtattc ttgatgcaac aaataaccaa ttgccacagg accgggaaag tttattctgg 3 60 

atgaacgtta aagcgattcc gtcaatggat aaatcaaaat tgactgagaa tacgctacag 420 

ctcgcaatta tcagccgcat taaactgtac tatcgcccgg ctaaattagc gttgccaccc 480 

gatcaggccg cagaaaaatt aagatttcgt cgtagcgcga attctctgac gctgattaac 540 

ccgacaccct attacctgac ggtaacagag ttgaatgccg gaacccgggt tcttgaaaat 600 

gcattggtgc ctccaatggg cgaaagcacg gttaaattgc cttctgatgc aggaagcaat 660 

attacttacc gaacaataaa tgattatggc gcacttaccc ccaaaatgac gggcgtaatg 720 

gaataa 726 



<210> 48 

<211> 903 

<212> DNA 

<213> Artificial 

<220> 

<223> Sequence J96 fimH plus native signal sequence 

<400> 48 
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atgaaacgag ttattaccct gtttgctgta ctgctgatgg gctggtcggt aaatgcctgg 60 

tcattcgcct gtaaaaccgc caatggtacc gctatcccta ttggcggtgg cagcgccaat 120 

gtttatgtaa accttgcgcc cgtcgtgaat gtggggcaaa acctggtcgt ggatctttcg 180 

acgcaaatct tttgccataa cgattatccg gaaaccatta cagactatgt cacactgcaa 24 0 

cgaggctcgg cttatggcgg cgtgttatct aatttttccg ggaccgtaaa atatagtggc 300 

agtagctatc catttcctac caccagcgaa acgccgcgcg ttgtttataa ttcgagaacg 360 

gataagccgt ggccggtggc gctttatttg acgcctgtga gcagtgcggg cggggtggcg 420 

attaaagctg gctcattaat tgccgtgctt attttgcgac agaccaacaa ctataacagc 480 

gatgatttcc agtttgtgtg gaatatttac gccaataatg atgtggtggt gcctactggc 540 

ggctgcgatg tttctgctcg tgatgtcacc gttactctgc cggactaccc tggttcagtg 600 

ccaattcctc ttaccgttta ttgtgcgaaa agccaaaacc tggggtatta cctctccggc 660 

acaaccgcag atgcgggcaa ctcgattttc accaataccg cgtcgttttc acctgcacag 720 

ggcgtcggcg tacagttgac gcgcaacggt acgattattc cagcgaataa cacggtatcg 780 

ttaggagcag tagggacttc ggcggtgagt ctgggattaa cggcaaatta tgcacgtacc 840 

ggagggcagg tgactgcagg gaatgtgcaa tcgattattg gcgtgacttt tgtttatcaa 900 
taa ~ ^ - 



<210> 49 
<211> 814 
<212> DNA 
<213> Artificial 

<220> 

<223> Sequence of kanamycin R gene 
<400> 49 

atgagccata ttcaacggga aacgtcttgc tctaggccgc gattaaattc caacatggat 60 

gctgatttat atgggtataa atgggctcgc gataatgtcg ggcaatcagg tgcgacaatc 120 

tatcgattgt atgggaagcc cgatgcgcca gagttgtttc tgaaacatgg cacaggtagc 180 

gctgccaatg atgttacaga tgagatggtc agactaaact ggctgacgga atttatgcct 240 

cttcgaccat caaccatttt atccgtactc ctgatgatgc atggttactc accactgcga 300 

tccccggaaa acagcattcc aggtattaga agaatatcct gattcaggtg aaaatattgt 360 

tgatgcgctg gcagtgttcc tgcgccggtt gcattcgatt cctgtttgta attgtccttt 420 
taacagcgat cgcgtatttc gtctcgctca ggcgcaatca ccaatgaata acggtttggt 480 

tgatgcgagt gattttgatg acgagcgtaa tggctggcct gttgaacaag tctggaaaga 540 

aatgcataaa cttttgccat tctcaccgga ttcagtcgtc actcatggtg atttctcact 600 

taataacctt atttttgacg aggggaaatt aataggttgt attgatgttg gacgagtcgg 660 

aatcgcagac cgataccagg atcttgccat cctatggaac tgcctcggtg agttttctcc 720 

ttcattacag aaacggcttt ttcaaaaata tggtattgat aatcctgata tgaataaatt 780 

gcagtttcat ttgatgctcg atgagttttt ctaa 814 



<210> 50 

<211> 1085 

<212> DNA 

<213> Artificial 

<220> 

<223> Sequence of I*ac IQ 
<400> 50 

atgtgaaacc agtaacgtta tacgatgtcg cagagtatgc cggtgtctct tatcagaccg 60 
tttcccgcgt ggtgaaccag gccagccacg tttctgcgaa aacgcgggaa aaagtggaag 120 
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cggcgatggc ggagctgaat tacattccca accgcgtggc acaacaactg gcgggcaaac 180 

agtcgttgct gattggcgtt gccacctcca gtctggccct gcacgcgccg tcgcaaattg 240 

tcgcggcgat taaatctcgc gccgatcaac tgggtgccag cgtggtggtg tcgatggtag 300 

aacgaagcgg cgtcgaagcc tgtaaagcgg cggtgcacaa tcttctcgcg caacgcgtca 360 

gtgggctgat cattaactat ccgctggatg accaggatgc cattgctgtg gaagctgcct 420 

gcactaatgt tccggcgtta tttcttgatg tctctgacca gacacccatc aacagtatta 480 

ttttctccca tgaagacggt acgcgactgg gcgtggagca tctggtcgca ttgggtcacc 540 

agcaaatcgc gctgttagcg ggcccattaa gttctgtctc ggcgcgtctg cgtctggctg 600 

gctggcataa atatctcact cgcaatcaaa ttcagccgat agcggaacgg gaaggcgact 660 

ggagtgccat gtccggtttt caacaaacca tgcaaatgct gaatgagggc atcgttccca 720 

ctgcgatgct ggttgccaac gatcagatgg cgctgggcgc aatgcgcgcc attaccgagt 780 

ccgggctgcg cgttggtgcg gatatctcgg tagtgggata cgacgatacc gaagacagct 840 

catgttatat cccgccgtta accaccatca aacaggattt tcgcctgctg gggcaaacca 900 

gcgtggaccg cttgctgcaa ctctctcagg gccaggcggt gaagggcaat cagctgttgc 960 

ccgtctcact ggtgaaaaga aaaaccaccc tggcgcccaa tacgcaaacc gcctctcccc 1020 

gcgcgttggc cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc 1080 

agtga 1085 



<210> 51 
<211> 862 
<212> DNA 

<213> Sequence of beta- lacy amase gene 
<400> 51 

atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct 60 

gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca 120 

cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc 180 

gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc 240 

cgtattgacg ccgggcaaga gcaactcggg tcgccgcata cactattctc agaatgactt 300 

ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag taagagaatt 3 60 

atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc tgacaacgat 420 

cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg taactcgcct 480 

tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg acaccacgat 540 

gcctgtagca atgcaacaac gttggcgcaa actattaact ggcgaactac ttactctagc 600 

ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac cacttctgcg 660 

ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg agcgtgggtc 720 

tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg tagttatcta 780 

cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg agataggtgc 840 

ctcactgatt aagcattggt aa 862 



Sequence of the origin of replication 



<210> 52 

<211> 601 

<212> DNA 

<213> Artificial 

<220> 
<223> 

<400> 52 

tttttctgcg cgtaatctgc tgcttgcaaa 
gtttgccgga tcaagagcta ccaactcttt 
agataccaaa tactgttctt ctagtgtagc 



caaaaaaacc accgctacca gcggtggttt 
ttccgaaggt aactggcttc agcagagcgc 
cgtagctagg ccaccacttc aagaactctg 
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tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg 240 

ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt 300 

cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac 3 60 

tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg 420 

acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg 480 

gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat 540 

ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt 600 

t ~ ~ 601 



<210> 53 

<211> 116 

<212> DNA 

<213> Sequence of Iiac p/o 



<400> 53 

cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc 60 
ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagct 116 



<210> 54 

<211> 837 

<212> DNA 

<213> E. coli 



<400> 54 

ttcgcctgta aaaccgccaa tggtaccgct 
tatgtaaacc ttgcgcccgt cgtgaatgtg 
caaatctttt gccataacga ttatccggaa 
ggctcggctt atggcggcgt gttatctaat 
agctatccat ttcctaccac cagcgaaacg 
aagccgtggc cggtggcgct ttatttgacg 
aaagctggct cattaattgc cgtgcttatt 
gatttccagt ttgtgtggaa tatttacgcc 
tgcgatgttt ctgctcgtga tgtcaccgtt 
attcctctta ccgtttattg tgcgaaaagc 
accgcagatg cgggcaactc gattttcacc 
gtcggcgtac agttgacgcg caacggtacg 
ggagcagtag ggacttcggc ggtgagtctg 
gggcaggtga ctgcagggaa tgtgcaatcg 



atccctattg gcggtggcag cgccaatgtt 60 

gggcaaaacc tggtcgtgga tctttcgacg 120 

accattacag actatgtcac actgcaacga 180 

ttttccggga ccgtaaaata taatggcagt 240 

ccgcgcgttg tttataattc gagaacggat 3 00 

cctgtgagca gtgcgggcgg ggtggcgatt 360 

ttgcgacaga ccaacaacta taacagcgat 420 

aataatgatg tggtggtgcc tactggcggc 480 

actctgccgg actaccctgg ttcagtgcca 540 

caaaacctgg ggtattacct ctccggcaca 600 

aataccgcgt cgttttcacc tgcacagggc 660 

attattccag cgaataacac ggtatcgtta 720 

ggattaacgg caaattatgc acgtaccgga 780 

attattggcg tgacttttgt ttatcaa 837 



<210> 55 

<211> 279 

<212> PRT 

<213> Artificial 



<220> 

<223> Consensus sequence of FimH proteins for SEQ ID NO: 23 to 45 
<400> 55 

Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He Pro He Gly Gly Gly 
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1 



5 



10 



15 



Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala Val Asn Val Gly Gin 
20 25 30 

Asn Leu Val Val Asp Leu Ser Thr Gin lie Phe Cys His Asn Asp Tyr 
35 40 45 

Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin Arg Gly Ser Ala Tyr 
50 55 60 

Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val Lys Tyr Ser Gly Ser 
65 70 75 80 

Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro Arg Val Val Tyr Asn 
85 90 95 

Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu Tyr Leu Thr Pro Val 
100 105 110 

Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly Ser Leu He Ala Val 
115 120 125 

Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin Phe 
130 135 140 

Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly Gly 
145 150 155 160 

Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr Arg 
165 170 175 

Gly Ser Val "Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin Asn 
180 185 190 

Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser He 
195 200 205 

Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val Gin 
210 215 220 

Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asn Thr Val Ser Leu 
225 230 235 240 

Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn Tyr 
245 250 255 

Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser He He 
260 265 270 

Gly Val Thr Phe Val Tyr Gin 
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<210> 56 

<211> 55 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide primer GA1F 

<400> 56 

cctgccatgg cgggtgtggc gctgggtgcg acccgcgtga tttatccggc agggc 55 



<210> 57 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide primer GA1R 

<400> 57 

ggcgtcgaca gattctatta ttccattacg cccgtc 36 



<210> 58 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide primer GA13F 

<400> 58 

cacacaggaa acagctatga ttgtaatgaa aacgag 36 



<210> 59 

<211> 39 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide primer 6A6R 



<400> 59 

ggcgtcgacg gatccttatt gataaacaaa agtcacgcc 
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<210> 
<211> 
<212> 
<213> 
<220> 
<223> 



60 
30 
DNA 

Artificial 

Oligonucleotide primer GA11F 



43 



WO 02/04496 



<400> 60 

ccgaataaag atatcacgac aggtttcccg 



<210> 61 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide primer GA9R 

<400> 61 

catagctgtt tcctgtgtg 



<210> 62 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide primer GA24F 

<400> 62 

tgctcacatg ttctttcctg cgt 



<210> 63 

<211> 34 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide primer GA23R 

<400> 63 

gacgttttta ttactcatag ctgtttcctg tgtg 



<210> 64 

<211> 37 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide primer GA21F 

<400> 64 

atgagtaata aaaacgtcaa tgtaaggaaa tcgcagg 
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